Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ base_model:
|
|
| 11 |
|
| 12 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 13 |
|
| 14 |
-
A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT
|
| 15 |
|
| 16 |
Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`)
|
| 17 |
|
|
|
|
| 11 |
|
| 12 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 13 |
|
| 14 |
+
A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT`. To be used as warmed up model to RL in `vf-reverse-text`.
|
| 15 |
|
| 16 |
Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`)
|
| 17 |
|