Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

184

Full-text search

Active filters: nvfp4

drbaph/FireRed-Image-Edit-1.0_ComfyUI_Quants

Image-Text-to-Image • Updated 1 day ago • 10

vincentzed-hf/Qwen3.5-397B-A17B-NVFP4

Image-Text-to-Text • Updated 5 days ago • 17.8k • 9

GadflyII/Qwen3-Coder-Next-NVFP4

Text Generation • Updated 19 days ago • 216k • 21

nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4

Text Generation • Updated 14 days ago • 76.5k • 49

tacos4me/Step-3.5-Flash-NVFP4

Text Generation • 111B • Updated 1 day ago • 1.49k • 5

mlx-community/Qwen3.5-397B-A17B-nvfp4

Text Generation • 396B • Updated 6 days ago • 3.51k • 4

GadflyII/GLM-4.6V-NVFP4

Image-Text-to-Text • 62B • Updated Jan 12 • 19.3k • 10

vincentzed-hf/Qwen3-Coder-Next-NVFP4

Text Generation • Updated 7 days ago • 5.52k • 6

Sehyo/Qwen3.5-397B-A17B-NVFP4

Updated about 13 hours ago • 1.45k • 3

nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4

Text Generation • Updated 14 days ago • 29.9k • 27

GadflyII/GLM-4.7-Flash-MTP-NVFP4

Text Generation • 19B • Updated 21 days ago • 4.64k • 2

nvidia/DeepSeek-V3.1-NVFP4

Text Generation • 394B • Updated Jan 13 • 37.7k • 13

nvidia/DeepSeek-V3.2-NVFP4

Text Generation • 394B • Updated Jan 21 • 14.8k • 6

nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4

Text Generation • Updated 23 days ago • 635 • 4

nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4

Text Generation • 120B • Updated 23 days ago • 1.91k • 3

GadflyII/GLM-4.7-Flash-NVFP4

Text Generation • Updated Jan 20 • 289k • 63

glux-cz/Qwen3-8B-NVFP4-Blackwell

Text Generation • Updated Jan 22 • 76 • 1

lopi999/wan22_i2v_nvfp4

Image-to-Video • Updated 26 days ago • 114 • 1

JEILDLWLRMA/Qwen3-VL-4B-Instruct-NVFP4

Image-to-Text • 3B • Updated 16 days ago • 40 • 2

vistralis/Qwen3-4B-NVFP4

Text Generation • 3B • Updated 15 days ago • 94 • 1

apolloparty/Qwen3-4B-NVFP4A16

2B • Updated Jul 12, 2025 • 6

cortecs/Qwen3-8B-NVFP4A16

5B • Updated Nov 27, 2025 • 2

cortecs/Qwen3-8B-NVFP4

5B • Updated Nov 27, 2025 • 4

cortecs/Qwen3-8B-clean-sparse

6B • Updated Nov 27, 2025 • 2

cortecs/Qwen3-8B-clean-sparse-nvfp4a16

5B • Updated Nov 27, 2025

cortecs/Qwen3-8B-clean-sparse-finetuned-0.01-nvfp4a16

5B • Updated Nov 27, 2025 • 1

cortecs/Qwen3-8B-clean-sparse-finetuned-0.1-nvfp4a16

5B • Updated Nov 27, 2025 • 1

llmat/Mistral-Small-24B-Instruct-2501-NVFP4

Text Generation • 14B • Updated Aug 27, 2025 • 93

llmat/Qwen3-30B-A3B-Instruct-2507-NVFP4

Text Generation • 17B • Updated Aug 27, 2025 • 596 • 2

llmat/Qwen3-4B-Instruct-2507-NVFP4

Text Generation • 3B • Updated Aug 27, 2025 • 355 • 1