inferencerlabs
/

openai-gpt-oss-20b-MLX-6.5bit

Text Generation

Model card Files Files and versions

inferencerlabs commited on 9 days ago

Commit

5e7d07d

·

verified ·

1 Parent(s): 4393170

Upload complete model

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -20,7 +20,8 @@ base_model: openai/gpt-oss-20b
 ## Usage Notes
-* Built with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.26
 * Memory usage: ~17 GB (down from ~46GB required by native MXFP4 format)
 * Expect ~100 tokens/s
 * For more details see [demonstration video](https://youtu.be/mlpFG8e_fLw) or visit [OpenAI gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b).

 ## Usage Notes
+* Tested to run with [Inferencer app](https://inferencer.com)
 * Memory usage: ~17 GB (down from ~46GB required by native MXFP4 format)
 * Expect ~100 tokens/s
+* Quantized with a modified version of [MLX](https://github.com/ml-explore/mlx) 0.26
 * For more details see [demonstration video](https://youtu.be/mlpFG8e_fLw) or visit [OpenAI gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b).