File size: 793 Bytes
5980fe8 cc64ac9 5980fe8 d924b0e | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 | ---
base_model:
- Qwen/Qwen2.5-32B-Instruct
language:
- zho
- eng
- fra
- spa
- por
- deu
- ita
- rus
- jpn
- kor
- vie
- tha
- ara
datasets:
- IlyaGusev/saiga_preferences
- 40umov/dostoevsky
- Vikhrmodels/gutenpromax
---
# Model Card for radm/Qwen2.5-32B-simpo-FP8
## Model Details
Improved quality on hard tasks by 25 percent relative to the base model Qwen2.5-32B-Instruct. Improved multilingual support.
Fine-tuning on A100 in 4-bit with unsloth using SIMPO and custom dataset
LoRA adapter: [radm/Qwen2.5-32B-simpo-LoRA](https://huggingface.co/radm/Qwen2.5-32B-simpo-LoRA)
### Eval results
Eval results on [ZebraLogic](https://github.com/WildEval/ZeroEval)
 |