This collection contains Quark quantized and OGA exported LLM models for execution on CPU
-
amd/Phi-3-mini-4k-instruct_int4_float16_onnx_cpu
Updated -
amd/Qwen1.5-7B-Chat_uint4_asym_g128_float16_onnx_cpu
Updated -
amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-cpu
Text Generation • Updated -
amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid
Updated • 291 • 1