Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZeroNihl 's Collections
Yuren

Yuren

updated Jun 3
Upvote
1

  • Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

    Paper • 2506.01939 • Published Jun 2 • 177

  • SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

    Paper • 2506.01844 • Published Jun 2 • 128

  • Taming LLMs by Scaling Learning Rates with Gradient Grouping

    Paper • 2506.01049 • Published Jun 1 • 38

  • ARIA: Training Language Agents with Intention-Driven Reward Aggregation

    Paper • 2506.00539 • Published May 31 • 30

  • Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

    Paper • 2506.01943 • Published Jun 2 • 24

  • SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

    Paper • 2506.01713 • Published Jun 2 • 47

  • AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

    Paper • 2505.24298 • Published May 30 • 27

  • MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning

    Paper • 2505.24846 • Published May 30 • 15

  • Esoteric Language Models

    Paper • 2506.01928 • Published Jun 2 • 9
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略