Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
280
9
15
Edward Beeching
edbeeching
Follow
mrm8488's profile picture
yamasaki's profile picture
jroth's profile picture
237 followers
·
29 following
https://edbeeching.github.io/
edbeeching
AI & ML interests
None yet
Organizations
edbeeching
's models
372
Sort: Recently updated
edbeeching/Qwen2.5-1.5B-Open-R1-Distill-dev
Updated
about 1 month ago
edbeeching/OpenR1-Distill-7B-packing-benchmarks
8B
•
Updated
Jun 9
•
8
edbeeching/OpenR1-Distill-7B
Text Generation
•
8B
•
Updated
Jun 7
•
8
edbeeching/SmolLM3-3B-instruct
Updated
Jun 2
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Jun 2
•
9
edbeeching/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
May 22
•
5
edbeeching/Qwen2.5-7B-Instruct-GRPO
8B
•
Updated
Mar 25
•
7
edbeeching/Qwen2.5-Math-7B-Instruct-SFT
Text Generation
•
8B
•
Updated
Mar 25
•
7
edbeeching/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Mar 11
edbeeching/Qwen2.5-Coder-3B-Instruct-sft
Text Generation
•
3B
•
Updated
Feb 22
•
5
edbeeching/pythia-1b-deduped-tldr-online-dpo
Updated
Feb 19
edbeeching/DeepSeek-R1-Distill-Qwen-1.5-GRPO
2B
•
Updated
Feb 7
•
4
edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Jan 30
edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Jan 30
edbeeching/gkd-model-compile
Updated
Oct 17, 2024
edbeeching/gkd-model-no-compile
Updated
Oct 17, 2024
edbeeching/EleutherAI_pythia-1b
Text Generation
•
1B
•
Updated
Aug 1, 2024
•
14
edbeeching/EleutherAI_pythia-2.8b
Text Generation
•
3B
•
Updated
Aug 1, 2024
•
4
edbeeching/dpo_tldr_1b
Text Generation
•
1B
•
Updated
Aug 1, 2024
•
5
edbeeching/EleutherAI_pythia-6.9b
Updated
Jul 26, 2024
edbeeching/online_dpo_tldr_6.9b
Text Generation
•
7B
•
Updated
Jul 25, 2024
•
3
edbeeching/dpo_tldr_6.9b
Updated
Jul 25, 2024
edbeeching/vsft-llava_builder_Meta-Llama-3-8B
Image-to-Text
•
8B
•
Updated
Apr 23, 2024
•
8
edbeeching/vsft-llava_builder-meta-Llama-3-8B
Updated
Apr 23, 2024
edbeeching/vsft-llava_builder_zephyr-7b-beta
Image-to-Text
•
8B
•
Updated
Apr 20, 2024
•
9
edbeeching/vsft-llava_builder
Updated
Apr 19, 2024
edbeeching/atari_2B_atari_stargunner_2222
Reinforcement Learning
•
Updated
Apr 16, 2024
edbeeching/atari_2B_atari_stargunner_1111
Reinforcement Learning
•
Updated
Apr 16, 2024
edbeeching/atari_2B_atari_spaceinvaders_2222
Reinforcement Learning
•
Updated
Apr 16, 2024
•
1
edbeeching/atari_2B_atari_spaceinvaders_1111
Reinforcement Learning
•
Updated
Apr 16, 2024
•
2
Previous
1
2
3
...
13
Next