Update README.md
Browse files
README.md
CHANGED
|
@@ -6,13 +6,13 @@ library_name: transformers
|
|
| 6 |
tags:
|
| 7 |
- mergekit
|
| 8 |
- merge
|
| 9 |
-
|
| 10 |
---
|
| 11 |
# merge
|
| 12 |
|
| 13 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
| 14 |
|
| 15 |
-
|
| 16 |
### Merge Method
|
| 17 |
|
| 18 |
This model was merged using the [SLERP](https://en.wikipedia.org/wiki/Slerp) merge method.
|
|
@@ -36,4 +36,4 @@ base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
|
|
| 36 |
dtype: bfloat16
|
| 37 |
parameters:
|
| 38 |
t: [0, 0.5, 0.7, 1, 0.7, 0.5, 0]
|
| 39 |
-
```
|
|
|
|
| 6 |
tags:
|
| 7 |
- mergekit
|
| 8 |
- merge
|
| 9 |
+
license: apache-2.0
|
| 10 |
---
|
| 11 |
# merge
|
| 12 |
|
| 13 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
| 14 |
|
| 15 |
+
|
| 16 |
### Merge Method
|
| 17 |
|
| 18 |
This model was merged using the [SLERP](https://en.wikipedia.org/wiki/Slerp) merge method.
|
|
|
|
| 36 |
dtype: bfloat16
|
| 37 |
parameters:
|
| 38 |
t: [0, 0.5, 0.7, 1, 0.7, 0.5, 0]
|
| 39 |
+
```
|