Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted a paper 1 day ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

liked a model 2 days ago

deepseek-ai/DeepSeek-V3.1

liked a model 2 days ago

CohereLabs/command-a-reasoning-08-2025

View all activity

Organizations

upvoted a paper 1 day ago

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published 3 days ago • 33

upvoted a paper 2 days ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published 4 days ago • 29

upvoted an article 4 days ago

Article

Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era

By

and 1 other •

4 days ago

• 12

upvoted a paper 5 days ago

τ^2-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Paper • 2506.07982 • Published Jun 9 • 6

upvoted an article 5 days ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

By

and 1 other •

7 days ago

• 35

upvoted an article 10 days ago

Article

Announcing the Synthetic Online Conversations Dataset (SOC)

By

•

12 days ago

• 11

upvoted a paper 12 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published 16 days ago • 156

upvoted 2 articles 16 days ago

Article

The GPT-OSS models are here… and they’re energy-efficient!

By

•

17 days ago

• 19

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By

and 4 others •

17 days ago

• 51

upvoted an article 17 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 323

upvoted a collection 17 days ago

IFBench

Datasets for IFBench benchmark and paper! • 3 items • Updated Jul 3 • 5

upvoted an article 19 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

By

and 11 others •

20 days ago

• 472

upvoted a collection 19 days ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 17 days ago • 316

upvoted an article 21 days ago

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 846

upvoted an article 24 days ago

Article

Introducing Command A Vision: Multimodal AI built for Business

By

and 3 others •

24 days ago

• 63

upvoted an article 26 days ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By

and 4 others •

27 days ago

• 159

upvoted a paper 27 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 290

upvoted an article 28 days ago

Article

Parquet Content-Defined Chunking

By

•

about 1 month ago

• 61

upvoted a paper 30 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 280

upvoted a collection about 1 month ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 12 items • Updated 19 days ago • 71