Inference Providers
Active filters: sglang
Jackrong/Qwopus3.5-27B-v3-FP8-vllm-ready
Text Generation
• 27B • Updated • 1.14k
• 9
Alexzander85/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-NVFP4-MLP-FP8KV
Text Generation
• 8B • Updated • 1.38k
• 9
thoughtworks/Gemma-4-31B-Eagle3
Text Generation
• 0.6B • Updated • 439
• 2
Doradus-AI/Hermes-4.3-36B-FP8
Text Generation
• 36B • Updated • 157
• 3
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 135k
• 24
Image-Text-to-Text
• 3B • Updated • 2.05k
• 3
AxionML/Qwen3.5-35B-A3B-NVFP4
Image-Text-to-Text
• Updated • 10.7k
• 4
apothic/Qwen3.5-9B-ultra-heretic-fp8
Image-Text-to-Text
• 9B • Updated • 634
• 4
Text-to-Speech
• Updated • 2.25k
• 17
KyleHessling1/Qwopus3.5-27B-v3-FP8-vllm-ready
Text Generation
• 27B • Updated • 1.79k
• 1
Image-Text-to-Text
• 7B • Updated • 41.8k
• 11
SurfaceData/llava-v1.6-mistral-7b-sglang
Image-Text-to-Text
• 8B • Updated • 26
• 9
SurfaceData/llava-v1.6-vicuna-7b-sglang
Image-Text-to-Text
• 7B • Updated • 26
• 1
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 92
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 87
alvarobartt/grok-2-tokenizer
Text Generation
• Updated • 38
• 4
173B • Updated • 1.92k
• 35
mradermacher/MiniMax-M2-THRIFT-GGUF
JasmineBBB/Kimi-Linear-48B-A3B-Instruct-bnb-4bit
Text Generation
• 49B • Updated • 12
• 1
mradermacher/MiniMax-M2-THRIFT-i1-GGUF
173B • Updated • 155
• 10
bartowski/VibeStudio_MiniMax-M2-THRIFT-GGUF
Text Generation
• 173B • Updated • 436
• 8
osmapi/MiniMax-M2-THRIFT-55
106B • Updated • 201
• 5
JinnP/SGLang-EAGLE3-Qwen3-Coder-30B-A3B-Instruct
Text Generation
• 0.2B • Updated • 63
• 1
mradermacher/MiniMax-M2-THRIFT-55-GGUF
106B • Updated • 68
• 2
mradermacher/MiniMax-M2-THRIFT-55-i1-GGUF
106B • Updated • 464
• 2
osmapi/MiniMax-M2-THRIFT-55-MLX-4bit
106B • Updated • 88
• 2
osmapi/MiniMax-M2-THRIFT-55-MLX-6bit
106B • Updated • 35
Doradus-AI/MiroThinker-v1.0-30B-FP8
Text Generation
• 31B • Updated • 91
• 4
Doradus-AI/RnJ-1-Instruct-FP8
Text Generation
• 9B • Updated • 4
• 4