view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo Dec 23, 2024 • 51
view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement Nov 7, 2025 • 4
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 88
view article Article Understanding Gemma 3n: How MatFormer Gives You Many Models in One Jun 26, 2025 • 48
view article Article How Long Prompts Block Other Requests - Optimizing LLM Performance Jun 12, 2025 • 8
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 • 58
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2, 2025 • 187
view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl Apr 4, 2025 • 15
view article Article The case for specialized pre-training: ultra-fast foundation models for dedicated tasks Aug 4, 2024 • 30
Scotch & SOTA 🥃 Pt. 7: Human Feedback Datasets 🫣 Collection The elusive “human” feedback • 1 item • Updated Sep 13, 2023 • 1
Scotch & SOTA 🥃 Pt. 6: Dialogue Tuning Datasets 💬 Collection Conversations, turn-based dialog, and things that can be turned into that. • 4 items • Updated Sep 13, 2023 • 1
Scotch & SOTA 🥃 Pt. 5: Instruction Tuning Datasets 👩🏫 Collection Question & answer, task completion, general SFT and otherwise finetuney data. • 7 items • Updated Sep 13, 2023 • 1