view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand about 1 month ago • 63
Running on CPU Upgrade Featured 2.78k The Smol Training Playbook 📚 2.78k The secrets to building world-class LLMs
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 247
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 178