Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a model about 2 hours ago

lewtun/qwen3-0.6b-sft-capybara

published a model about 2 hours ago

lewtun/qwen3-0.6b-sft-capybara

updated a dataset about 2 hours ago

lewtun/ml-intern-sessions

View all activity

Organizations

liked a Space about 3 hours ago

physics-intern: an Autonomous Agent for Physics Research

Generate autonomous research reports for physics problems

liked a model 6 days ago

Zyphra/ZAYA1-8B

9B • Updated about 21 hours ago • 66.1k • 448

liked a Space 7 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

liked a Space 8 days ago

MoE Recipe Builder

Tetris-style recipe builder for Qwen3-30B-A3B MoE training

liked 2 Spaces 11 days ago

Hutter Prize Dashboard

Dashboard for the Hutter Prize (100MB) collab

Efficient Optimizer Live

Dashboard for the Efficient Optimizer challenge

liked 2 models 14 days ago

poolside/Laguna-XS.2

Text Generation • 33B • Updated 4 days ago • 25.6k • 245

lewtun/talkie-1930-13b-it-hf

Text Generation • 13B • Updated 14 days ago • 6.41k • 23

liked a Space 14 days ago

Talkie 1930

Chat with a 1930s‑style language model

liked 2 models 15 days ago

talkie-lm/talkie-1930-13b-it

Updated 19 days ago • 260

talkie-lm/talkie-1930-13b-base

Updated 19 days ago • 85

liked a model 19 days ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 7 days ago • 2.02M • • 3.89k

liked a model 20 days ago

openai/privacy-filter

Token Classification • 1B • Updated 20 days ago • 191k • 1.42k

liked a Space 22 days ago

Traces Viewer

Explore and visualize trace logs in an interactive web viewer

liked a Space 23 days ago

Defeating the trainer-generator precision mismatch in TRL

Download research PDF (Pro access required)

liked a model 26 days ago

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated 19 days ago • 3.86M • • 1.74k

liked a dataset 28 days ago

GenerTeam/pretrain_data_eukaryote

Viewer • Updated 1 day ago • 100 • 275 • 3

liked a Space about 1 month ago

Distilling 100B+ Models 40x Faster with TRL

TRL distillation for 100B+ teachers, 40x faster

liked a dataset about 1 month ago

arcinstitute/opengenome2

Preview • Updated Sep 20, 2025 • 7.67k • 140

liked a model about 1 month ago

arcee-ai/Trinity-Large-Thinking

Text Generation • 399B • Updated 4 days ago • 21.7k • • 168