davanstrien HF Staff commited on
Commit
d74a3a5
·
verified ·
1 Parent(s): a80f5e3

Training in progress, step 10

Browse files
README.md CHANGED
@@ -27,7 +27,7 @@ print(output["generated_text"])
27
 
28
  ## Training procedure
29
 
30
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/davanstrien/huggingface/runs/z5i5c04r)
31
 
32
 
33
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
 
27
 
28
  ## Training procedure
29
 
30
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/davanstrien/huggingface/runs/p4e9o9sx)
31
 
32
 
33
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:34b1d747f22ccf86f78f0a208159ba0fbb0c781d759682053bf069ef7216b63e
3
  size 4493654912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75ef37464a4717eabcf2c59dde5ab84d2d57b1e3e1f1e05670116e560bd3794f
3
  size 4493654912
runs/Aug21_15-31-08_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755783070.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.695.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1c30e4935465a3da85e86312bbc72b739d01a177dee6374a4e2751426bd5f921
3
- size 20823
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a86e82b97dff19922625a17699ad6c22cc4feb2cd02502aebde604e69091a8a6
3
+ size 22413
runs/Aug21_15-49-00_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755784144.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.950.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2e9818d86fe7beead22797e1510e89cc251c5e047c55e4a055be00a190f127c9
3
+ size 11282
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:94c511eed230e54a76834e91875f44d11e37797c6c622143cb1d00178c169a1b
3
  size 7057
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5ca0b3638974540b10c0644b5fcda76749b0268c69e29471de42bf80211b4171
3
  size 7057