Inference Providers
Active filters: open-r1
wzx111/Qwen3-1.7B-MATH-GDPO
Text Generation
• 2B • Updated • 44
• 2
smolagents/SmolVLM2-2.2B-Instruct-Agentic-GUI
Image-Text-to-Text
• 2B • Updated • 78
• 65
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 19
yucaiwen/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 8
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO
Text Generation
• 8B • Updated • 23
• 1
JinnP/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 7
bangan/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 6
liusq19/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 8
stepyoun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 7
howey/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 5
wxnfifth/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 20
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
• 8B • Updated • 9
Text Generation
• 8B • Updated • 2
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
• 2B • Updated • 6
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
• 2B • Updated • 15
skzxjus/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 9
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-GGUF
8B • Updated • 104
skzxjus/Qwen2.5-7B-1m-Open-R1-Distill
Text Generation
• 8B • Updated • 5
• • 4
skzxjus/Qwen2.5-7B-Open-R1-GRPO
Text Generation
• 8B • Updated • 7
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 7
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-i1-GGUF
8B • Updated • 342
yeshsurya/Qwen2.5-7B-Math-with_50stepGRPO
Text Generation
• 8B • Updated • 4
mradermacher/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math-GGUF
2B • Updated • 53
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-GGUF
8B • Updated • 265
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
• 8B • Updated • 6
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
• 8B • Updated • 7
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
• 2B • Updated • 6
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
• 2B • Updated • 4
yh-yao/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 13
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
• 8B • Updated • 3
• • 6