Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ base_model:
|
|
11 |
|
12 |
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
|
14 |
-
A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT
|
15 |
|
16 |
Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`)
|
17 |
|
|
|
11 |
|
12 |
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
|
14 |
+
A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT`. To be used as warmed up model to RL in `vf-reverse-text`.
|
15 |
|
16 |
Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`)
|
17 |
|