Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
9
Pierre Erbacher
erbacher
Follow
AngeV's profile picture
dokutech's profile picture
2 followers
·
6 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
12 days ago
erbacher/trl-NuminaMath-LEAN
published
a dataset
12 days ago
erbacher/trl-NuminaMath-LEAN
upvoted
a
paper
about 2 months ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
View all activity
Organizations
Papers
1
arxiv:
2410.03437
models
47
Sort: Recently updated
erbacher/wiki_categories
Updated
Mar 7
erbacher/Llama-3.2-Tulu-3-1B-SFT
1B
•
Updated
Jan 2
•
4
erbacher/llama3.2-1B-MATH
Text Generation
•
1B
•
Updated
Dec 3, 2024
•
7
erbacher/llama-1B-online-dpo
Updated
Oct 22, 2024
erbacher/zephyr-rag-agent-webgpt
Text Generation
•
3B
•
Updated
Jun 24, 2024
•
1
•
1
erbacher/zephyr-3b-rag-agent-webgpt
Updated
Jun 12, 2024
erbacher/gptneo-125M-acco-32a
Text Generation
•
0.1B
•
Updated
May 22, 2024
•
3
erbacher/gptneo-125M-ddp-32a
Text Generation
•
0.1B
•
Updated
May 22, 2024
•
2
erbacher/gpt-neo-125M-ddp
Text Generation
•
0.1B
•
Updated
May 21, 2024
•
1
erbacher/gpt-neo-125M-acco
Text Generation
•
0.1B
•
Updated
May 21, 2024
•
1
View 47 models
datasets
17
Sort: Recently updated
erbacher/trl-NuminaMath-LEAN
Viewer
•
Updated
12 days ago
•
104k
•
93
erbacher/MATH_TTT
Viewer
•
Updated
Jun 14
•
12k
•
7
erbacher/open-math-instruct-steps
Updated
Mar 4
•
2
erbacher/rag-n-roll-webgpt-deduplicated
Viewer
•
Updated
Jun 17, 2024
•
50.8k
•
2
erbacher/rag-and-roll-hagrid-deduplicated
Viewer
•
Updated
Jun 13, 2024
•
3.71k
•
5
erbacher/ragnroll-webgpt
Viewer
•
Updated
Jun 9, 2024
•
13.8k
•
4
erbacher/rag-and-roll-hagrid
Viewer
•
Updated
May 13, 2024
•
2.64k
•
3
•
1
erbacher/proactive_image_generation
Viewer
•
Updated
Feb 28, 2024
•
7.01k
•
3
erbacher/PDEBench-1D
Viewer
•
Updated
Dec 20, 2023
•
400k
•
395
erbacher/personalized-proactive-conversations
Viewer
•
Updated
Dec 18, 2023
•
28.1k
•
7
•
1
View 17 datasets