lebar-mj commited on
Commit
7c370df
·
verified ·
1 Parent(s): bceaaa3

Training in progress, step 250

Browse files
README.md CHANGED
@@ -27,7 +27,7 @@ print(output["generated_text"])
27
 
28
  ## Training procedure
29
 
30
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/mlebar-university-of-chicago/huggingface/runs/hexno4qp)
31
 
32
 
33
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
 
27
 
28
  ## Training procedure
29
 
30
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/mlebar-university-of-chicago/huggingface/runs/3zie9zvw)
31
 
32
 
33
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
adapter_config.json CHANGED
@@ -27,8 +27,8 @@
27
  "target_modules": [
28
  "o_proj",
29
  "v_proj",
30
- "q_proj",
31
- "k_proj"
32
  ],
33
  "target_parameters": null,
34
  "task_type": "CAUSAL_LM",
 
27
  "target_modules": [
28
  "o_proj",
29
  "v_proj",
30
+ "k_proj",
31
+ "q_proj"
32
  ],
33
  "target_parameters": null,
34
  "task_type": "CAUSAL_LM",
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7f4bb726ad95629766da4b92ab26a116207292cb43ccc728aeeda61b41afc0b2
3
  size 54560368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fcd3c7f772fc675324673de02a8a30013df18a86fde68f5491c6988172241372
3
  size 54560368
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:807e78b8e9e08713a031953bbaa8e5b43797919f9435c6f627e06f0fb118d97c
3
  size 6993
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:db73dc0d431d7f1a9ef9e5523050776d01882c73d5f3e7fbedabb0ebd86cca38
3
  size 6993