Harsh Pareek
harshhpareek
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
24 days ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
Organizations
None yet