Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,6 @@ pipeline_tag: text-generation
|
|
13 |
|
14 |

|
15 |
|
16 |
-
---
|
17 |
|
18 |
# Luth-1.7B-Instruct
|
19 |
|
@@ -21,6 +20,8 @@ pipeline_tag: text-generation
|
|
21 |
|
22 |
Our Evaluation, training and data scripts are available on [GitHub](https://github.com/kurakurai/Luth), along with the [Blog](https://huggingface.co/blog/MaxLSB/luth) we wrote.
|
23 |
|
|
|
|
|
24 |
## Model Details
|
25 |
|
26 |
Luth was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-1.7B model. This process successfully retained the model's English capabilities while improving its performance on most selected benchmarks in both French and English.
|
|
|
13 |
|
14 |

|
15 |
|
|
|
16 |
|
17 |
# Luth-1.7B-Instruct
|
18 |
|
|
|
20 |
|
21 |
Our Evaluation, training and data scripts are available on [GitHub](https://github.com/kurakurai/Luth), along with the [Blog](https://huggingface.co/blog/MaxLSB/luth) we wrote.
|
22 |
|
23 |
+

|
24 |
+
|
25 |
## Model Details
|
26 |
|
27 |
Luth was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-1.7B model. This process successfully retained the model's English capabilities while improving its performance on most selected benchmarks in both French and English.
|