YOYO-AI
/

QwQ-Coder-instruct

Text Generation

Model card Files Files and versions

YOYO-AI commited on Feb 12

Commit

56920e4

·

verified ·

1 Parent(s): c77b370

Update README.md

Files changed (1) hide show

README.md +37 -0

README.md CHANGED Viewed

@@ -11,4 +11,41 @@ pipeline_tag: text-generation
 tags:
 - merge
 ---

 tags:
 - merge
 ---
+# QwQ-Coder-instruct
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64e174e202fa032de4143324/hHMN168t4-JhJwo0tCM8d.png)
+## Introduction:
+Without compromising the long-chain reasoning capabilities of the **QwQ** model, the integration of **Qwen2.5-Coder-32B-instruct** has significantly enhanced the model's **coding abilities** and **instruction-following skills**.
+Based on my practical tests, the results are exceptionally impressive!
+## merge
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+### Merge Method
+This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [Qwen/Qwen2.5-Coder-32B](https://huggingface.co/Qwen/Qwen2.5-Coder-32B) as a base.
+### Models Merged
+The following models were included in the merge:
+* [Qwen/Qwen2.5-Coder-32B-instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-instruct)
+* [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+merge_method: sce
+models:
+  - model: Qwen/QwQ-32B-Preview
+  - model: Qwen/Qwen2.5-Coder-32B-instruct
+base_model: Qwen/Qwen2.5-Coder-32B
+parameters:
+  select_topk: 1
+dtype: bfloat16
+normalize: true
+```