DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 3 days ago • 222
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 17 days ago • 69
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 20 days ago • 473
view article Article cocogold: training Marigold for text-grounded segmentation By pcuenq • Jul 8 • 30
HAI-DEF Concept Apps Collection Collection of concept apps built around HAI-DEF open models/libraries to inspire the community. Learn more at http://goo.gle/hai-def` • 6 items • Updated 25 days ago • 31
Holo1 Collection Vision-Language Action Model for use in Surfer-H web navigation agent • 6 items • Updated Jun 10 • 48
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • May 21 • 206
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 454
Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 27
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 208
— UI is a good thing 💅 — Collection cool spaces with a cool UI, what could be better? • 5 items • Updated May 5 • 23
Foundation Models for Vision 🧩 Collection Foundation models for computer vision. • 24 items • Updated Mar 11, 2024 • 20
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. • 7 items • Updated Aug 24, 2024 • 21
view article Article Open-Source Handwritten Signature Detection Model By samuellimabraz • Mar 14 • 117
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 91 items • Updated Feb 28 • 109