-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
Paper • 2602.05885 • Published • 28 -
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated • 51 • 6 -
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated • 119 • 4 -
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated • 1.14k
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth
-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
Paper • 2602.05885 • Published • 28 -
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated • 51 • 6 -
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated • 119 • 4 -
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated • 1.14k
models 66
hkust-nlp/drkernel-8b-coldstart
Text Generation • 0.3B • Updated
• 4.5k
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated
• 1.14k
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated
• 51 • 6
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated
• 119 • 4
hkust-nlp/WebExplorer-8B
Image-Text-to-Text • 8B • Updated
• 215 • 12
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier
Reinforcement Learning • 8B • Updated
• 2
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated
• 3
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated
• 2
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated
• 6 • 1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated
• 3 • 1
datasets 32
hkust-nlp/drkernel-validation-data
Viewer
• Updated
• 100 • 73 • 1
hkust-nlp/drkernel-rl-data
Viewer
• Updated
• 72k • 48
hkust-nlp/drkernel-coldstart-8k
Viewer
• Updated
• 8.92k • 77 • 2
hkust-nlp/Toolathlon-Trajectories
Preview
• Updated
• 1.55k • 19
hkust-nlp/WebExplorer-QA
Viewer
• Updated
• 100 • 14 • 6
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
• 26 • 2
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
• Updated
• 101 • 56
hkust-nlp/rl-verifier-pitfalls_hacking_data
Viewer
• Updated
• 6.12k • 153 • 1
hkust-nlp/deepscaler_simplelr
Viewer
• Updated
• 40.3k • 12
hkust-nlp/Laser-Deepscaler-Dataset
Viewer
• Updated
• 40.8k • 87