view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 7 days ago • 37
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • Jun 12 • 125
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python By erikkaum and 6 others • Oct 22, 2024 • 44
view article Article TGI Multi-LoRA: Deploy Once, Serve 30 Models By derek-thomas and 2 others • Jul 18, 2024 • 60
view article Article From OpenAI to Open LLMs with Messages API By andrewrreed and 3 others • Feb 8, 2024 • 20