Edit Models filters
Apps
Inference Providers
Active filters:
nvfp4
vincentzed-hf/Qwen3.5-397B-A17B-NVFP4
Image-Text-to-Text
•
Updated
•
17.8k
•
9
GadflyII/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
216k
•
21
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
76.5k
•
49
tacos4me/Step-3.5-Flash-NVFP4
Text Generation
•
111B
•
Updated
•
1.49k
•
5
mlx-community/Qwen3.5-397B-A17B-nvfp4
Text Generation
•
396B
•
Updated
•
3.51k
•
4
GadflyII/GLM-4.6V-NVFP4
Image-Text-to-Text
•
62B
•
Updated
•
19.3k
•
10
vincentzed-hf/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
5.52k
•
6
Sehyo/Qwen3.5-397B-A17B-NVFP4
Updated
•
1.45k
•
3
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
29.9k
•
27
GadflyII/GLM-4.7-Flash-MTP-NVFP4
Text Generation
•
19B
•
Updated
•
4.64k
•
2
nvidia/DeepSeek-V3.1-NVFP4
Text Generation
•
394B
•
Updated
•
37.7k
•
13
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
14.8k
•
6
nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4
Text Generation
•
Updated
•
635
•
4
nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
•
120B
•
Updated
•
1.91k
•
3
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
Updated
•
289k
•
63
glux-cz/Qwen3-8B-NVFP4-Blackwell
Text Generation
•
Updated
•
76
•
1
lopi999/wan22_i2v_nvfp4
JEILDLWLRMA/Qwen3-VL-4B-Instruct-NVFP4
Image-to-Text
•
3B
•
Updated
•
40
•
2
vistralis/Qwen3-4B-NVFP4
Text Generation
•
3B
•
Updated
•
94
•
1
apolloparty/Qwen3-4B-NVFP4A16
2B
•
Updated
•
6
cortecs/Qwen3-8B-NVFP4A16
5B
•
Updated
•
2
cortecs/Qwen3-8B-NVFP4
5B
•
Updated
•
4
cortecs/Qwen3-8B-clean-sparse
6B
•
Updated
•
2
cortecs/Qwen3-8B-clean-sparse-nvfp4a16
5B
•
Updated
cortecs/Qwen3-8B-clean-sparse-finetuned-0.01-nvfp4a16
5B
•
Updated
•
1
cortecs/Qwen3-8B-clean-sparse-finetuned-0.1-nvfp4a16
5B
•
Updated
•
1
llmat/Mistral-Small-24B-Instruct-2501-NVFP4
Text Generation
•
14B
•
Updated
•
93
llmat/Qwen3-30B-A3B-Instruct-2507-NVFP4
Text Generation
•
17B
•
Updated
•
596
•
2
llmat/Qwen3-4B-Instruct-2507-NVFP4
Text Generation
•
3B
•
Updated
•
355
•
1