Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
krzonkalla 's Collections
relevant_papers

relevant_papers

updated 6 days ago
Upvote
-

  • Group Sequence Policy Optimization

    Paper • 2507.18071 • Published Jul 24 • 290

  • LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

    Paper • 2507.15758 • Published Jul 21 • 34

  • Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

    Paper • 2508.09726 • Published 12 days ago • 12

  • BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

    Paper • 2508.10975 • Published 10 days ago • 54
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略