view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge NormalUhr • Feb 7, 2025 • 295
view article Article A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny • Jan 19 • 34
NbAiLab/wav2vec2-large-danish-npsc-nst Automatic Speech Recognition • 0.3B • Updated Jan 6, 2025 • 4.3k • 2
language-and-voice-lab/wav2vec2-large-xlsr-53-icelandic-ep30-967h Automatic Speech Recognition • Updated Apr 25, 2025 • 411k • 3
language-and-voice-lab/whisper-large-icelandic-62640-steps-967h Automatic Speech Recognition • Updated Apr 25, 2025 • 280 • 4
NbAiLab/nb-whisper-large-distil-turbo-beta Automatic Speech Recognition • 0.8B • Updated Sep 10, 2025 • 806 • 11
Running on CPU Upgrade Featured 3.23k The Smol Training Playbook 📚 3.23k The secrets to building world-class LLMs