Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,6 @@ tags:
|
|
| 11 |
|
| 12 |
Exllamav3 quantization of [Qwen/Qwen3-Coder-480B-A35B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct)
|
| 13 |
|
| 14 |
-
[2.00 bpw h6](https://huggingface.co/MikeRoz/Qwen3-Coder-480B-A35B-Instruct-exl3/tree/2.00bpw_H6)
|
| 15 |
|
| 16 |
The 2.00bpw quant will fit in six 24 GB cards with 40k of fp16 context.
|
|
|
|
| 11 |
|
| 12 |
Exllamav3 quantization of [Qwen/Qwen3-Coder-480B-A35B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct)
|
| 13 |
|
| 14 |
+
[2.00 bpw h6](https://huggingface.co/MikeRoz/Qwen3-Coder-480B-A35B-Instruct-exl3/tree/2.00bpw_H6) 114.396 GiB
|
| 15 |
|
| 16 |
The 2.00bpw quant will fit in six 24 GB cards with 40k of fp16 context.
|