mlx-community
/

FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview-Q8

8-bit precision

Model card Files Files and versions

bobig commited on Feb 21

Commit

9bebb31

·

verified ·

1 Parent(s): 80aab64

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -60,6 +60,7 @@ response = generate(model, tokenizer, prompt=prompt, verbose=True)
 Are you still reading down here?
 Maybe check out this new Q4 lossless quant compression from NexaAI and tell the MLX community how to improve mlx-lm to get 8-bit quality at 4-bit speed!
 [DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant](https://huggingface.co/NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant)
 ore

 Are you still reading down here?
 Maybe check out this new Q4 lossless quant compression from NexaAI and tell the MLX community how to improve mlx-lm to get 8-bit quality at 4-bit speed!
 [DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant](https://huggingface.co/NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant)
 ore