|
--- |
|
library_name: transformers |
|
license: mit |
|
datasets: |
|
- PrimeIntellect/Reverse-Text-SFT |
|
base_model: |
|
- PrimeIntellect/Qwen3-0.6B |
|
--- |
|
|
|
# Qwen3-0.6B-Reverse-Text-SFT |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
|
|
A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT`. To be used as warmed up model to RL in `vf-reverse-text`. |
|
|
|
Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`) |
|
|
|
```bash |
|
uv run sft @ configs/reverse_text/sft.toml --ckpt |
|
``` |