Model Card for Model ID

This is the quantized version of Llama3.1-8B using bitsandbytes. More quantized LLMs coming soon...

Model Description

Model Source

Downloads last month
7
Safetensors
Model size
4.65B params
Tensor type
BF16
·
F32
·
U8
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support