mikasenghaas's picture
Update README.md
1f4a685 verified
|
raw
history blame
553 Bytes
---
library_name: transformers
license: mit
datasets:
- PrimeIntellect/Reverse-Text-SFT
base_model:
- PrimeIntellect/Qwen3-0.6B
---
# Qwen3-0.6B-Reverse-Text-SFT
<!-- Provide a quick summary of what the model is/does. -->
A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT`. To be used as warmed up model to RL in `vf-reverse-text`.
Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`)
```bash
uv run sft @ configs/reverse_text/sft.toml --ckpt
```