Peng Wang's picture

Peng Wang

stillarrow

·

https://peter-peng-w.github.io/

AI & ML interests

None yet

Recent Activity

liked a dataset about 13 hours ago

LLM360/guru-RL-92k

upvoted an article 15 days ago

From GRPO to DAPO and GSPO: What, Why, and How

upvoted an article about 1 month ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

View all activity

Organizations

None yet

New activity in allenai/ai2_arc 8 months ago

Incorrect question answer pair

#7 opened about 1 year ago by