-
-
-
-
-
-
Inference Providers
Active filters:
GRPO
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
0.5B
•
Updated
•
101
•
24
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B
•
Updated
•
89
•
1
mlx-community/VisualQuality-R1-7B-4bit
Reinforcement Learning
•
Updated
•
14
•
1
prithivMLmods/Bellatrix-Tiny-1B-R1
Text Generation
•
1B
•
Updated
•
13
•
1
mradermacher/Bellatrix-Tiny-1B-R1-GGUF
1B
•
Updated
•
227
mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF
1B
•
Updated
•
206
Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF
Text Generation
•
1B
•
Updated
•
6
Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF
Text Generation
•
1B
•
Updated
•
3
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_S-GGUF
Text Generation
•
1B
•
Updated
•
33
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_M-GGUF
Text Generation
•
1B
•
Updated
•
14
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_S-GGUF
Text Generation
•
1B
•
Updated
•
10
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_M-GGUF
Text Generation
•
1B
•
Updated
•
18
Triangle104/Bellatrix-Tiny-1B-R1-Q6_K-GGUF
Text Generation
•
1B
•
Updated
•
9
Triangle104/Bellatrix-Tiny-1B-R1-Q8_0-GGUF
Text Generation
•
1B
•
Updated
•
6
Reinforcement Learning
•
Updated
•
1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B
•
Updated
•
131
•
1
alpha-ai/Deep-Reason-SMALL-V0-GGUF
3B
•
Updated
•
37
•
1
alpha-ai/Deep-Reason-SMALL-V0
Text Generation
•
3B
•
Updated
•
17
•
2
mradermacher/Deep-Reason-SMALL-V0-GGUF
3B
•
Updated
•
67
•
2
mradermacher/Deep-Reason-SMALL-V0-i1-GGUF
3B
•
Updated
•
81
•
1
alpha-ai/qwen2.5-reason-thought-lite-GGUF
3B
•
Updated
•
327
alpha-ai/qwen2.5-reason-thought-lite
Text Generation
•
3B
•
Updated
•
3.4k
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF
3B
•
Updated
•
53
•
2
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
•
3B
•
Updated
•
28
mradermacher/Cogito-R1-GGUF
33B
•
Updated
•
770
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
•
1B
•
Updated
•
10
•
mradermacher/Cogito-R1-i1-GGUF
33B
•
Updated
•
627
AaryanK/Qwen_2.5_3B_GRPO_Reasoning_XIOSERV
3B
•
Updated
•
24
•
1
Nitral-AI/Captain-Eris_Violet-GRPO-v0.420
Text Generation
•
12B
•
Updated
•
86
•
•
25
prithivMLmods/SmolLM2_135M_Grpo_Gsm8k
Text Generation
•
0.1B
•
Updated
•
31
•
8