Training in progress, step 10

Browse files

Files changed (5) hide show

README.md +1 -1
model.safetensors +1 -1
runs/Aug21_15-31-08_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755783070.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.695.0 +2 -2
runs/Aug21_15-49-00_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755784144.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.950.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/davanstrien/huggingface/runs/z5i5c04r)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/davanstrien/huggingface/runs/p4e9o9sx)
 This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:34b1d747f22ccf86f78f0a208159ba0fbb0c781d759682053bf069ef7216b63e
 size 4493654912

 version https://git-lfs.github.com/spec/v1
+oid sha256:75ef37464a4717eabcf2c59dde5ab84d2d57b1e3e1f1e05670116e560bd3794f
 size 4493654912

runs/Aug21_15-31-08_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755783070.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.695.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1c30e4935465a3da85e86312bbc72b739d01a177dee6374a4e2751426bd5f921
-size 20823

 version https://git-lfs.github.com/spec/v1
+oid sha256:a86e82b97dff19922625a17699ad6c22cc4feb2cd02502aebde604e69091a8a6
+size 22413

runs/Aug21_15-49-00_r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x/events.out.tfevents.1755784144.r-davanstrien-jupyterlab-35e247rd-48eb2-uvu9x.950.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2e9818d86fe7beead22797e1510e89cc251c5e047c55e4a055be00a190f127c9
+size 11282

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:94c511eed230e54a76834e91875f44d11e37797c6c622143cb1d00178c169a1b
 size 7057

 version https://git-lfs.github.com/spec/v1
+oid sha256:5ca0b3638974540b10c0644b5fcda76749b0268c69e29471de42bf80211b4171
 size 7057