BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining Paper • 2508.10975 • Published 10 days ago • 53
view article Article 🇵🇭 FilBench - Can LLMs Understand and Generate Filipino? By ljvmiranda921 and 8 others • 13 days ago • 13
view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • 12 days ago • 11
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 654
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 636
view article Article LLM Hallucinations: bug or feature? The US Supreme Court 2025 cases experiment By dvilasuero • Jul 8 • 18
view article Article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs By davidberenstein1957 and 3 others • Jul 2 • 16
view article Article Transformers backend integration in SGLang By marcsun13 and 4 others • Jun 23 • 53
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 • 53
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana • May 26 • 44
view changelog Changelog Xet is now the default storage option for new users and organizations May 23 • 72
view article Article Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • May 21 • 34
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • May 19 • 26
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • May 15 • 117
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? By danaaubakirova and 6 others • May 11 • 74
view article Article Introducing Spaces Dev Mode for a seamless developer experience By pagezyhf • May 21, 2024 • 15
view article Article Welcome to Inference Providers on the Hub 🔥 By julien-c and 6 others • Jan 28 • 488