12 18 1

Jinyang Wu

Jinyang23

https://orcid.org/my-orcid?orcid=0009-0006-0220-616X

jinyangwu

AI & ML interests

large language models, reasoning, agentic rl

Recent Activity

upvoted a paper 2 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

upvoted a paper 2 days ago

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

upvoted a paper 2 days ago

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

View all activity

Organizations

None yet

upvoted 3 papers 2 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published 9 days ago • 9

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 5 days ago • 31

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Paper • 2602.02419 • Published 5 days ago • 4

upvoted 2 papers 4 days ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 9 days ago • 147

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 5 days ago • 200

upvoted a paper 5 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 8 days ago • 12

upvoted a paper 9 days ago

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 10 days ago • 22

upvoted 2 papers 23 days ago

MAXS: Meta-Adaptive Exploration with LLM Agents

Paper • 2601.09259 • Published 24 days ago • 95

A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

Paper • 2601.09274 • Published 24 days ago • 84

upvoted a paper about 1 month ago

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Paper • 2601.03872 • Published about 1 month ago • 42

upvoted a paper about 2 months ago

HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices

Paper • 2512.14052 • Published Dec 16, 2025 • 42

upvoted a paper 2 months ago

From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

Paper • 2512.02580 • Published Dec 2, 2025 • 28

upvoted a paper 4 months ago

Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints

Paper • 2510.08549 • Published Oct 9, 2025 • 7

upvoted a paper 9 months ago

Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities

Paper • 2505.15692 • Published May 21, 2025 • 14

upvoted a paper 11 months ago

DReSS: Data-driven Regularized Structured Streamlining for Large Language Models

Paper • 2501.17905 • Published Jan 29, 2025 • 2

upvoted 3 papers about 1 year ago

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Paper • 2502.02339 • Published Feb 4, 2025 • 23

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 288

Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

Paper • 2411.18478 • Published Nov 27, 2024 • 37

Jinyang Wu

AI & ML interests

Recent Activity

Organizations

Jinyang23's activity

🎉 Free Image Generator Now Available!