Shaobai Jiang
shaobaij
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference
Optimization
upvoted
a
paper
about 2 hours ago
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning
upvoted
a
paper
about 2 hours ago
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid
Mamba-Transformer Reasoning Model
Organizations
None yet