Training in progress, step 10
Browse files- README.md +1 -1
- model.safetensors +1 -1
- runs/Aug21_15-31-08_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755783070.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.695.0 +2 -2
- runs/Aug21_15-49-00_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755784144.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.950.0 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -27,7 +27,7 @@ print(output["generated_text"])
|
|
27 |
|
28 |
## Training procedure
|
29 |
|
30 |
-
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/davanstrien/huggingface/runs/
|
31 |
|
32 |
|
33 |
This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
|
|
|
27 |
|
28 |
## Training procedure
|
29 |
|
30 |
+
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/davanstrien/huggingface/runs/p4e9o9sx)
|
31 |
|
32 |
|
33 |
This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4493654912
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:75ef37464a4717eabcf2c59dde5ab84d2d57b1e3e1f1e05670116e560bd3794f
|
3 |
size 4493654912
|
runs/Aug21_15-31-08_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755783070.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.695.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a86e82b97dff19922625a17699ad6c22cc4feb2cd02502aebde604e69091a8a6
|
3 |
+
size 22413
|
runs/Aug21_15-49-00_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755784144.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.950.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2e9818d86fe7beead22797e1510e89cc251c5e047c55e4a055be00a190f127c9
|
3 |
+
size 11282
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 7057
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5ca0b3638974540b10c0644b5fcda76749b0268c69e29471de42bf80211b4171
|
3 |
size 7057
|