-
Group Sequence Policy Optimization
Paper • 2507.18071 • Published • 290 -
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization
Paper • 2507.15758 • Published • 34 -
Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning
Paper • 2508.09726 • Published • 12 -
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining
Paper • 2508.10975 • Published • 54
Rafael Coelho de Souza Krzonkalla
krzonkalla
AI & ML interests
None yet
Recent Activity
updated
a collection
6 days ago
relevant_papers
upvoted
a
paper
6 days ago
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale
Pretraining
upvoted
a
paper
6 days ago
Thyme: Think Beyond Images
Organizations
None yet