useful sharded checkpoints for users to run inference / fine-tuning on a Google colab without having to deal with CPU OOM issues.
Younes B
ybelkada
AI & ML interests
Large Language Models, Quantization, Vision, Multimodality, Diffusion models
Recent Activity
new activity
5 days ago
tiiuae/dense-3b-arch2:add config.json for iter_50000
upvoted
an
article
9 days ago
Falcon-Arabic: A Breakthrough in Arabic Language Models
new activity
17 days ago
tiiuae/dense-500m-arch1:iter_0044000 missing weights