view article Article PhysicsIntern: from an Autonomous Benchmark-runner to a Research Sidekick dlouapre • 7 days ago • 6
view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 7 days ago • 42
view article Article Optimum Intel 2.0: An OpenVINO-First Toolkit for Running Open Models on Intel jeffboudier • 7 days ago • 5
view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 14 days ago • 57
view article Article Introducing Storage Buckets on the Hugging Face Hub +10 Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner • Mar 10 • 196
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 507
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 355
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 danf, mber, moshew • Dec 4, 2025 • 40
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 405
view article Article Running Large Transformer Models on Mobile and Edge Devices tugrulkaya • Nov 3, 2025 • 13
view article Article Streaming datasets: 100x More Efficient +3 andito, lhoestq, burtenshaw, pcuenq, merve • Oct 27, 2025 • 86
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning +2 Wauplin, celinah, lysandre, julien-c • Oct 27, 2025 • 76
view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Jiqing, MatrixYao, kding1, IlyasMoutawwakil • Oct 16, 2025 • 18
view article Article Get your VLM running in 3 simple steps on Intel CPUs +3 ezelanza, helenai, nikita-savelyev-intel, echarlaix, IlyasMoutawwakil • Oct 15, 2025 • 22
view article Article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models +3 imargulis, ofirzaf, sguskin, guybd, pcuenq • Sep 29, 2025 • 25
view article Article Building the Hugging Face MCP Server +2 evalstate, julien-c, coyotte508, abidlabs • Jul 10, 2025 • 67
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders thomwolf, matthieu-lapeyre • Jul 9, 2025 • 803