Zixuan Yang's picture

4

Zixuan Yang

Luli3220

https://luli3220.github.io/

Luli3220

AI & ML interests

Post Training、RL

Recent Activity

upvoted a paper 18 days ago

When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents

authored a paper 24 days ago

MERIT: Matching Expertise via Rubric-Informed Training for Reviewer Assignment

updated a model 25 days ago

Luli3220/MERIT-4B-reviewer-assessor

View all activity

Organizations

None yet

upvoted a paper 18 days ago

When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents

Paper • 2606.05806 • Published 22 days ago • 23

upvoted a paper 25 days ago

MERIT: Matching Expertise via Rubric-Informed Training for Reviewer Assignment

Paper • 2605.27865 • Published 30 days ago • 1

upvoted 2 papers 28 days ago

Retrieval, Reward, and Training Protocols: What Matters in Training Search Agents?

Paper • 2605.27881 • Published 30 days ago • 10

Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning

Paper • 2605.28424 • Published 30 days ago • 32