Shaobai Jiang
shaobaij
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
SSRL: Self-Search Reinforcement Learning
upvoted
a
paper
about 2 hours ago
XQuant: Breaking the Memory Wall for LLM Inference with KV Cache
Rematerialization
upvoted
a
paper
about 2 hours ago
Sample More to Think Less: Group Filtered Policy Optimization for
Concise Reasoning
Organizations
None yet