"Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX
v1k
xbruce22
AI & ML interests
None yet
Recent Activity
new activity
about 18 hours ago
xai-org/grok-2:llama.ccp support & GGUFs
new activity
about 19 hours ago
xai-org/grok-2:Inspired from LICENSE I did this 🤯
liked
a model
2 days ago
unsloth/DeepSeek-V3.1-GGUF
Organizations
None yet