cgus
/

DeepCoder-14B-Preview-exl2

Text Generation

4-bit precision

Model card Files Files and versions

cgus commited on Apr 10

Commit

be1aeaf

·

verified ·

1 Parent(s): 7e71351

Update README.md

Files changed (1) hide show

README.md +19 -4

README.md CHANGED Viewed

@@ -1,6 +1,5 @@
 ---
-license: mit
-library_name: transformers
 datasets:
 - PrimeIntellect/verifiable-coding-problems
 - likaixin/TACO-verified
@@ -8,10 +7,26 @@ datasets:
 language:
 - en
 base_model:
-- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
 pipeline_tag: text-generation
 ---
 <div align="center">
 <span style="font-family: default; font-size: 1.5em;">DeepCoder-14B-Preview</span>
 <div>

 ---
+library_name: exllamav2
 datasets:
 - PrimeIntellect/verifiable-coding-problems
 - likaixin/TACO-verified
 language:
 - en
 base_model:
+- agentica-org/DeepCoder-14B-Preview
 pipeline_tag: text-generation
 ---
+# DeepCoder-14B-Preview-exl2
+Original model: [DeepCoder-14B-Preview](https://huggingface.co/agentica-org/DeepCoder-14B-Preview) by [Agentica](https://huggingface.co/agentica-org)
+Based on: [DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B) by [DeepSeek](https://huggingface.co/deepseek-ai)
+Foundation model: [Qwen2.5-14B](https://huggingface.co/Qwen/Qwen2.5-14B) by [Qwen](https://huggingface.co/Qwen)
+## Quants
+[4bpw h6 (main)](https://huggingface.co/cgus/DeepCoder-14B-Preview-exl2/tree/main)
+[4.5bpw h6](https://huggingface.co/cgus/DeepCoder-14B-Preview-exl2/tree/4.5bpw-h6)
+[5bpw h6](https://huggingface.co/cgus/DeepCoder-14B-Preview-exl2/tree/5bpw-h6)
+[6bpw h6](https://huggingface.co/cgus/DeepCoder-14B-Preview-exl2/tree/6bpw-h6)
+[8bpw h8](https://huggingface.co/cgus/DeepCoder-14B-Preview-exl2/tree/8bpw-h8)
+## Quantization notes
+Made with Exllamav2 0.2.8 with default dataset.
+It can be used with TabbyAPI, Text-Generation-WebUI and requires RTX GPU on Windows or RTX/ROCm on Linux.
+RAM offloading isn't supported natively, so make sure it fits your GPU VRAM.
+I'd recommend at least a 12GB GPU for 4-5bpw quants.
+# Original model card
 <div align="center">
 <span style="font-family: default; font-size: 1.5em;">DeepCoder-14B-Preview</span>
 <div>