Small Language Models
updated
facebook/opt-iml-max-1.3b
Text Generation
• Updated • 4.94k
• 43
Text Generation
• Updated • 22.5k
• 87
togethercomputer/RedPajama-INCITE-Chat-3B-v1
Text Generation
• Updated • 2.04k
• 152
Text Generation
• 3B • Updated • 399k
• 502
Text Generation
• 3B • Updated • 112k
• 33
cerebras/Cerebras-GPT-2.7B
Text Generation
• Updated • 3.44k
• 46
M4-ai/TinyMistral-6x248M-Instruct
Text Generation
• 1B • Updated • 6
• 11
M4-ai/NeuralReyna-Mini-1.8B-v0.3
Text Generation
• 2B • Updated • 89
• 11
stabilityai/stablelm-2-zephyr-1_6b
Text Generation
• 2B • Updated • 6.24k
• 186
stabilityai/stable-code-instruct-3b
Text Generation
• 3B • Updated • 1.65k
• 182
stabilityai/stablelm-zephyr-3b
Text Generation
• 3B • Updated • 17.7k
• 259
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation
• 1B • Updated • 2.97M
• 1.56k
Text Generation
• 1B • Updated • 27.7k
• 28
Text Generation
• Updated • 11.2k
• 125
Text Generation
• 4B • Updated • 15k
• 45
Text Generation
• 2B • Updated • 2.94M
• • 161
Qwen/Qwen2.5-Coder-1.5B-Instruct
Text Generation
• 2B • Updated • 571k
• • 112
Text Generation
• 3B • Updated • 8.68M
• 437
Text Generation
• 3B • Updated • 57.3k
• 867
Text Generation
• 3B • Updated • 80k
• 173
Text Generation
• 3B • Updated • 11.7k
• 97
Text Generation
• Updated • 174
• 24
Text Generation
• 1B • Updated • 6.13k
• 219
Text Generation
• 3B • Updated • 360k
• • 1.33k
Text Generation
• 1B • Updated • 74.8k
• 1.36k
Text Generation
• 3B • Updated • 1.34M
• 3.44k
ministral/Ministral-3b-instruct
Text Generation
• 3B • Updated • 2.17k
• 83
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
• 2B • Updated • 13.9k
• 118
h2oai/h2o-danube-1.8b-chat
Text Generation
• 2B • Updated • 154
• 55
h2oai/h2o-danube2-1.8b-chat
Text Generation
• 2B • Updated • 287
• 62
h2oai/h2o-danube3-4b-chat
Text Generation
• 4B • Updated • 1.1k
• 68
h2oai/h2o-danube3.1-4b-chat
Text Generation
• 4B • Updated • 249
• 5
Text Generation
• 1B • Updated • 174
• 42
Text Generation
• 6B • Updated • 25k
• 70
Text Generation
• 6B • Updated • 5.88k
• 41
Updated • 211
• 258
Updated • 151k
• 1.16k
Text Generation
• 4B • Updated • 842
• 12
meta-llama/Llama-3.2-3B-Instruct
Text Generation
• 3B • Updated • 5.76M
• 2.09k
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
• 3B • Updated • 7.13k
• 175
ibm-granite/granite-3b-code-instruct-2k
Text Generation
• Updated • 2.44k
• 39
ibm-granite/granite-3.0-2b-instruct
Text Generation
• 3B • Updated • 4.96k
• 49
HuggingFaceTB/SmolLM2-1.7B
Text Generation
• 2B • Updated • 71.1k
• 147
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 751k
• • 1.47k
apple/OpenELM-3B-Instruct
Text Generation
• 3B • Updated • 3.18k
• 339
internlm/internlm2-chat-1_8b
Text Generation
• 2B • Updated • 5.07k
• 35
internlm/internlm2_5-1_8b-chat
Text Generation
• 2B • Updated • 1.49k
• 25
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
• 2B • Updated • 14k
• 577
microsoft/Phi-3-mini-128k-instruct
Text Generation
• Updated • 247k
• 1.7k
microsoft/Phi-4-mini-instruct
Text Generation
• Updated • 1.01M
• 712
Text Generation
• 1.0B • Updated • 890k
• 909
Text Generation
• Updated • 163
• 122
ibm-granite/granite-3.3-2b-instruct
Text Generation
• Updated • 29.4k
• 84
Text Generation
• 4B • Updated • 4.49k
• • 503
Text Generation
• 3B • Updated • 1.1M
• 930
Qwen/Qwen3-4B-Instruct-2507
Text Generation
• 4B • Updated • 7.36M
• • 802
Qwen/Qwen3-4B-Thinking-2507
Text Generation
• 4B • Updated • 1.68M
• • 576
Text Generation
• 3B • Updated • 6.58k
• 184
ibm-granite/granite-4.0-h-micro
Text Generation
• 3B • Updated • 46.8k
• 142
Alibaba-Apsara/DASD-4B-Thinking
Text Generation
• Updated • 396
• 217
mistralai/Ministral-3-3B-Reasoning-2512
4B • Updated • 13.5k
• 111
mistralai/Ministral-3-3B-Instruct-2512
Updated • 132k
• 216
Text Generation
• 2B • Updated • 933
• 227
Nanbeige/Nanbeige4-3B-Thinking-2511
Text Generation
• 4B • Updated • 1.88k
• 204
Text Generation
• 4B • Updated • 417k
• • 1.03k
LiquidAI/LFM2.5-1.2B-Instruct
Text Generation
• 1B • Updated • 325k
• 555
LiquidAI/LFM2.5-1.2B-Thinking
Text Generation
• 1B • Updated • 27.7k
• 327
Text Generation
• 4B • Updated • 518
• 75
janhq/Jan-v3-4B-base-instruct
Text Generation
• 4B • Updated • 599
• 60
CohereLabs/tiny-aya-global
Text Generation
• 3B • Updated • 42.4k
• • 147
Image-Text-to-Text
• 5B • Updated • 2.91M
• 448