Inference Providers
Active filters: rl
bigatuna/Qwen3.5-9b-Sushi-Coder-RL-GGUF
Text Generation
• 9B • Updated • 4.31k
• 25
bigatuna/Qwen3.5-9b-Sushi-Coder-RL-MLX
Text Generation
• 2B • Updated • 624
• 3
pankajmathur/RenCoder-Devstral-Small-2507
Text Generation
• 24B • Updated • 138
• 1
Indelwin/Qwen3-30B-A3B-ToolAgent-GRPO
Text Generation
• Updated • 42
• 2
Reinforcement Learning
• Updated d-byrne/snake-v1_training_state
Updated
InstaDeepAI/jumanji-benchmark-a2c-BinPack-v2
Updated
InstaDeepAI/jumanji-benchmark-a2c-CVRP-v1
ContextualAI/archangel_sft_pythia1-4b
Text Generation
• 1B • Updated • 6
ContextualAI/archangel_sft_pythia2-8b
Text Generation
• 3B • Updated • 5
• 1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
• 7B • Updated • 5
ContextualAI/archangel_sft_pythia12-0b
Text Generation
• 12B • Updated • 5
ContextualAI/archangel_sft_llama7b
Text Generation
• 7B • Updated • 10
• 1
ContextualAI/archangel_sft_llama13b
Text Generation
• 13B • Updated • 10
ContextualAI/archangel_sft_llama30b
Text Generation
• 33B • Updated • 5
ContextualAI/archangel_slic_llama30b
Text Generation
• 33B • Updated • 4
ContextualAI/archangel_slic_pythia1-4b
Text Generation
• 1B • Updated • 4
ContextualAI/archangel_slic_pythia2-8b
Text Generation
• 3B • Updated • 3
ContextualAI/archangel_slic_pythia6-9b
Text Generation
• 7B • Updated • 9
ContextualAI/archangel_slic_pythia12-0b
Text Generation
• 12B • Updated • 8
ContextualAI/archangel_slic_llama7b
Text Generation
• 7B • Updated • 15
• 1
ContextualAI/archangel_slic_llama13b
Text Generation
• 13B • Updated • 4
ContextualAI/archangel_dpo_pythia1-4b
Text Generation
• 1B • Updated • 2
ContextualAI/archangel_dpo_pythia2-8b
Text Generation
• 3B • Updated • 7
ContextualAI/archangel_dpo_pythia6-9b
Text Generation
• 7B • Updated • 4
ContextualAI/archangel_dpo_pythia12-0b
Text Generation
• 12B • Updated • 5
ContextualAI/archangel_dpo_llama7b
Text Generation
• 7B • Updated • 5
ContextualAI/archangel_dpo_llama13b
Text Generation
• 13B • Updated • 11
ContextualAI/archangel_dpo_llama30b
Text Generation
• 33B • Updated • 4
ContextualAI/archangel_kto_pythia1-4b
Text Generation
• 1B • Updated