mikasenghaas commited on
Commit
4b1ccd1
·
verified ·
1 Parent(s): bb086bd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ base_model:
11
 
12
  <!-- Provide a quick summary of what the model is/does. -->
13
 
14
- A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT` for 100 steps (~1.5 epochs) in <1 min. To be used as warmed up model to RL in `vf-reverse-text`.
15
 
16
  Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`)
17
 
 
11
 
12
  <!-- Provide a quick summary of what the model is/does. -->
13
 
14
+ A debug model fine-tuned on 128 token context on `PrimeIntellect/Reverse-Text-SFT`. To be used as warmed up model to RL in `vf-reverse-text`.
15
 
16
  Created with this training command from [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) (commit hash: `ed25704`)
17