inferencerlabs
/

deepseek-v3.1-MLX-5.5bit

Text Generation

Model card Files Files and versions

inferencerlabs commited on 2 days ago

Commit

8b23818

·

verified ·

1 Parent(s): 49b708a

Upload complete model

Files changed (1) hide show

README.md +2 -6

README.md CHANGED Viewed

@@ -6,10 +6,6 @@ tags:
 - mlx
 pipeline_tag: text-generation
 ---
-## ----CURRENTLY UPLOADING FILES-----
-This notice will be removed once all files have been uploaded.
-## Notes
 **See DeepSeek-V3.1 5.5bit MLX in action - [demonstration video](https://youtu.be/ufXZI6aqOU8)**
 *q5.5bit quant typically achieves 1.141 perplexity in our testing*
@@ -24,8 +20,8 @@ This notice will be removed once all files have been uploaded.
 ## Usage Notes
-* Runs on a single M3 Ultra 512GB RAM
 * Memory usage: ~480 GB
 * Expect ~13-19 tokens/s
-* Built with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.26
 * For more details see [demonstration video](https://youtu.be/ufXZI6aqOU8) or visit [DeepSeek-V3.1](https://huggingface.co/deepseek-ai/DeepSeek-V3.1).

 - mlx
 pipeline_tag: text-generation
 ---
 **See DeepSeek-V3.1 5.5bit MLX in action - [demonstration video](https://youtu.be/ufXZI6aqOU8)**
 *q5.5bit quant typically achieves 1.141 perplexity in our testing*
 ## Usage Notes
+* Runs on a single M3 Ultra 512GB RAM using [Inferencer app](https://inferencer.com)
 * Memory usage: ~480 GB
 * Expect ~13-19 tokens/s
+* Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.26
 * For more details see [demonstration video](https://youtu.be/ufXZI6aqOU8) or visit [DeepSeek-V3.1](https://huggingface.co/deepseek-ai/DeepSeek-V3.1).