pangpangxuan
pangxuan
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
Qwen3-Coder-Next Technical Report upvoted a paper 4 days ago
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey upvoted a paper 7 days ago
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy OptimizationOrganizations
None yet