Update README.md
Browse files
README.md
CHANGED
|
@@ -3,9 +3,8 @@ base_model:
|
|
| 3 |
- mistralai/Mistral-Small-3.2-24B-Instruct-2506
|
| 4 |
---
|
| 5 |
|
| 6 |
-
|
| 7 |
-
- No vision encoder
|
| 8 |
-
- Reused some special tokens for ChatML tokens
|
| 9 |
-
- Standard "Mistral" architecture
|
| 10 |
|
| 11 |
-
|
|
|
|
|
|
|
|
|
| 3 |
- mistralai/Mistral-Small-3.2-24B-Instruct-2506
|
| 4 |
---
|
| 5 |
|
| 6 |
+
Trained from [anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-ChatML](https://huggingface.co/anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-ChatML) for convenience (no Pixtral compatibility needed). No vision adapter currently.
|
|
|
|
|
|
|
|
|
|
| 7 |
|
| 8 |
+
Test model trained at 16k context on 50M tokens of long-form human writing (mostly books).
|
| 9 |
+
|
| 10 |
+
Haven't tested yet but regular Tekken v7 instruct will work and samplers are probably the same as you'd use for 3.2 Instruct.
|