amd
's Collections
Quark Quantized OCP FP8 Models
updated
amd/Llama-3.1-8B-Instruct-FP8-KV
8B
•
Updated
•
24.7k
•
6
amd/Llama-3.1-70B-Instruct-FP8-KV
71B
•
Updated
•
35.4k
•
4
amd/Llama-3.1-405B-Instruct-FP8-KV
406B
•
Updated
•
3.6k
•
5
amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV
3B
•
Updated
•
1.49k
•
3
amd/dbrx-instruct-FP8-KV
132B
•
Updated
•
1.01k
amd/deepseek-moe-16b-chat-FP8-KV
16B
•
Updated
•
14
amd/grok-1-FP8-KV
316B
•
Updated
•
80
•
1
amd/Mixtral-8x22B-Instruct-v0.1-FP8-KV
141B
•
Updated
•
779
•
3
amd/c4ai-command-r-plus-FP8-KV
104B
•
Updated
•
1.03k
amd/Llama-3.2-1B-FP8-KV
1B
•
Updated
•
15
amd/Llama-3.2-1B-Instruct-FP8-KV
1B
•
Updated
•
8.06k
amd/Llama-3.2-3B-FP8-KV
3B
•
Updated
•
18
amd/Llama-3.2-3B-Instruct-FP8-KV
3B
•
Updated
•
26
amd/Llama-3.2-11B-Vision-Instruct-FP8-KV
11B
•
Updated
•
12
•
1
amd/Llama-3.2-90B-Vision-Instruct-FP8-KV
amd/jais-13b-chat-FP8
13B
•
Updated
•
4
amd/dbrx-base-FP8-KV
132B
•
Updated
•
6
amd/Mistral-7B-v0.1-FP8-KV
7B
•
Updated
•
134
amd/Llama-3.3-70B-Instruct-FP8-KV
71B
•
Updated
•
4.25k
•
3
amd/grok-1-W4A8KV8
Updated
•
23
•
1
amd/Llama-2-70b-chat-hf_FP8_MLPerf_V2
69B
•
Updated
•
99