Models
Datasets
Spaces
Buckets new
Docs
Enterprise
免费去水印
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2605.06130

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 70
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 38
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

Reinforcement learning

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

Paper • 2407.20798 • Published Jul 30, 2024 • 24
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 10 days ago • 185
From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 19 days ago • 162
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Paper • 2605.03849 • Published 17 days ago • 124
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 18 days ago • 119

about 18 hours ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published Mar 27 • 144
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 15 days ago • 110

about 18 hours ago

Graph Neural Network Training with Data Tiering

Paper • 2111.05894 • Published Nov 10, 2021
Graph Neural Networks are Dynamic Programmers

Paper • 2203.15544 • Published Mar 29, 2022 • 1
Graph Neural Networks for Jamming Source Localization

Paper • 2506.03196 • Published Jun 1, 2025
Code as Agent Harness

Paper • 2605.18747 • Published 4 days ago • 193

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Paper • 2604.18543 • Published Apr 20 • 30
Near-Future Policy Optimization

Paper • 2604.20733 • Published about 1 month ago • 76
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

Paper • 2604.20987 • Published about 1 month ago • 21
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering

Paper • 2602.23161 • Published Feb 26

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 70
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 38
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 15 days ago • 110

Reinforcement learning

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

Paper • 2407.20798 • Published Jul 30, 2024 • 24
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

about 18 hours ago

Graph Neural Network Training with Data Tiering

Paper • 2111.05894 • Published Nov 10, 2021
Graph Neural Networks are Dynamic Programmers

Paper • 2203.15544 • Published Mar 29, 2022 • 1
Graph Neural Networks for Jamming Source Localization

Paper • 2506.03196 • Published Jun 1, 2025
Code as Agent Harness

Paper • 2605.18747 • Published 4 days ago • 193

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 10 days ago • 185
From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 19 days ago • 162
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

Paper • 2605.03849 • Published 17 days ago • 124
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 18 days ago • 119

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Paper • 2604.18543 • Published Apr 20 • 30
Near-Future Policy Optimization

Paper • 2604.20733 • Published about 1 month ago • 76
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks

Paper • 2604.20987 • Published about 1 month ago • 21
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering

Paper • 2602.23161 • Published Feb 26

about 18 hours ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published Mar 27 • 144
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

Company

TOS Privacy About Careers

Website

Models Datasets 免费Z-image图片生成免费去水印 Vibevoice

Free Tool

Free AI Image Generator

Create images in seconds. No sign-up, no paywall, no setup.

No Sign-Up
Instant Results
Ready to Use

Great for posters, avatars, covers, and social visuals.

Free AI Image Generator No sign-up. Instant results. Open Now