radm
/

Qwen2.5-32B-simpo-FP8

Model card Files Files and versions

radm commited on Apr 28

Commit

cc64ac9

·

verified ·

1 Parent(s): 5980fe8

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -27,7 +27,9 @@ datasets:
 Improved quality on hard tasks by 25 percent relative to the base model Qwen2.5-32B-Instruct. Improved multilingual support.
-Fine-tuning on A100 in 4-bit with unsloth using SIMPO and internal dataset
 ### Eval results

 Improved quality on hard tasks by 25 percent relative to the base model Qwen2.5-32B-Instruct. Improved multilingual support.
+Fine-tuning on A100 in 4-bit with unsloth using SIMPO and custom dataset
+LoRA adapter: [radm/Qwen2.5-32B-simpo-LoRA](https://huggingface.co/radm/Qwen2.5-32B-simpo-LoRA)
 ### Eval results