-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
Collections
Discover the best community collections!
Collections including paper arxiv:2605.06130
-
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Paper • 2407.20798 • Published • 24 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38 -
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models
Paper • 2501.03262 • Published • 104 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75
-
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Paper • 2605.12500 • Published • 185 -
From Context to Skills: Can Language Models Learn from Context Skillfully?
Paper • 2604.27660 • Published • 162 -
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
Paper • 2605.03849 • Published • 124 -
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
Paper • 2605.03042 • Published • 119
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
Graph Neural Network Training with Data Tiering
Paper • 2111.05894 • Published -
Graph Neural Networks are Dynamic Programmers
Paper • 2203.15544 • Published • 1 -
Graph Neural Networks for Jamming Source Localization
Paper • 2506.03196 • Published -
Code as Agent Harness
Paper • 2605.18747 • Published • 193
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 30 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 76 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning
Paper • 2407.20798 • Published • 24 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38 -
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models
Paper • 2501.03262 • Published • 104 -
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution
Paper • 2502.18449 • Published • 75
-
Graph Neural Network Training with Data Tiering
Paper • 2111.05894 • Published -
Graph Neural Networks are Dynamic Programmers
Paper • 2203.15544 • Published • 1 -
Graph Neural Networks for Jamming Source Localization
Paper • 2506.03196 • Published -
Code as Agent Harness
Paper • 2605.18747 • Published • 193
-
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Paper • 2605.12500 • Published • 185 -
From Context to Skills: Can Language Models Learn from Context Skillfully?
Paper • 2604.27660 • Published • 162 -
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
Paper • 2605.03849 • Published • 124 -
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
Paper • 2605.03042 • Published • 119
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 30 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 76 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147