AI & ML interests
None defined yet.
Recent Activity
Papers
Verifiable Environments Are LEGO Bricks: Recursive Composition for Reasoning Generalization
Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling
-
QwenScope
🔥37Explore and steer Qwen3 model features with interactive heatmaps
-
Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models
Paper • 2605.11887 • Published • 15 -
Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_50
Updated • 184 • 38 -
Qwen/SAE-Res-Qwen3.5-2B-Base-W32K-L0_50
Updated • 85 • 12
-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 620k • • 1.51k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 609k • 177 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 656k • • 569 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 759k • 105
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • 2B • Updated • 1.14M • 878 -
Qwen/Qwen3-ASR-0.6B
Automatic Speech Recognition • 0.9B • Updated • 601k • 300 -
Qwen/Qwen3-ForcedAligner-0.6B
Automatic Speech Recognition • 0.9B • Updated • 307k • 142 -
Qwen3-ASR Demo
🎙140Transcribe audio to text with timestamps and visualization
-
Qwen3 VL Demo
😻439Chat with AI using text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 4.74k • • 398 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 1.18M • • 393 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 5.25k • 29
-
Qwen3 Omni Demo
⚡265Chat with AI using text, audio, images, or video
-
Qwen3 Omni Captioner Demo
🐠63Generate a caption for any uploaded or recorded audio
-
Qwen/Qwen3-Omni-30B-A3B-Captioner
Any-to-Any • 32B • Updated • 4.36k • 228 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 1.27M • 938
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 26.9k • 85 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 41.4k • • 406 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 161k • 147 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 104k • • 784
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 249k • • 1.03k -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 21.7k • • 489 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 137k • 90 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • 81B • Updated • 2.68k • 55
-
Qwen3 Coder WebDev
🌍1.08kGenerate web app HTML/React code from a text description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 19k • • 1.34k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 102k • 154 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 1.44M • • 1.11k
-
Qwen2.5 VL 32B Instruct Demo
🏃167Chat with a multimodal AI using text, images, or video
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 219 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • 33B • Updated • 303k • 491 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 309k • • 630
-
Qwen2.5 Coder Artifacts
🐢1.73kGenerate and preview app code from a text description
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.16M • • 2.04k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 2.75k • • 156 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 157
-
QwenScope
🔥37Explore and steer Qwen3 model features with interactive heatmaps
-
Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models
Paper • 2605.11887 • Published • 15 -
Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_50
Updated • 184 • 38 -
Qwen/SAE-Res-Qwen3.5-2B-Base-W32K-L0_50
Updated • 85 • 12
-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 620k • • 1.51k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 609k • 177 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 656k • • 569 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 759k • 105
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • 2B • Updated • 1.14M • 878 -
Qwen/Qwen3-ASR-0.6B
Automatic Speech Recognition • 0.9B • Updated • 601k • 300 -
Qwen/Qwen3-ForcedAligner-0.6B
Automatic Speech Recognition • 0.9B • Updated • 307k • 142 -
Qwen3-ASR Demo
🎙140Transcribe audio to text with timestamps and visualization
-
Qwen3 VL Demo
😻439Chat with AI using text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 4.74k • • 398 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 1.18M • • 393 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 5.25k • 29
-
Qwen3 Omni Demo
⚡265Chat with AI using text, audio, images, or video
-
Qwen3 Omni Captioner Demo
🐠63Generate a caption for any uploaded or recorded audio
-
Qwen/Qwen3-Omni-30B-A3B-Captioner
Any-to-Any • 32B • Updated • 4.36k • 228 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 1.27M • 938
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 249k • • 1.03k -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 21.7k • • 489 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 137k • 90 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • 81B • Updated • 2.68k • 55
-
Qwen3 Coder WebDev
🌍1.08kGenerate web app HTML/React code from a text description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 19k • • 1.34k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 102k • 154 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 1.44M • • 1.11k
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 26.9k • 85 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 41.4k • • 406 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 161k • 147 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 104k • • 784
-
Qwen2.5 VL 32B Instruct Demo
🏃167Chat with a multimodal AI using text, images, or video
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 219 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • 33B • Updated • 303k • 491 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 309k • • 630
-
Qwen2.5 Coder Artifacts
🐢1.73kGenerate and preview app code from a text description
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.16M • • 2.04k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 2.75k • • 156 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 157