Bill H's picture

2 6 51

Bill H

lccurious

·

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Energy-based Automated Model Evaluation

authored a paper 5 days ago

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

authored a paper 5 days ago

Reinforcement Learning with Rubric Anchors

View all activity

Organizations

upvoted a paper 6 days ago

Reinforcement Learning with Rubric Anchors

Paper • 2508.12790 • Published 7 days ago • 8

upvoted a paper about 1 month ago

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 38

upvoted an article 6 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted a collection 6 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 164

upvoted an article 7 months ago

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 305

upvoted a collection about 1 year ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 246