Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Releases August 9
Releases August 2
Releases July 25
Releases July 18
Releases July 11
Releases July 4
Releases June 27
June 20 Releases
OCR Models & Datasets
Releases June 13
Releases June 6
Releases 30 May
Releases 23 May
May 16 Releases
May 9 Releases
Any-to-Any Models, Datasets, Spaces
Releases Apr 21 & May 2
InternVL3 HF
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
Türkçe VLMler
Feb 14 Releases 💌
Feb 7 Releases 🧣
January 31 Releases 🧤
Models, Jan 27
Jan 24 Releases
Jan 17 Releases ❄️
Jan 10 Releases 🌨️
Dec 6 Releases 🎄
Nov 29 Releases 🌲🌲
Nov 22 Releases ❄️
Nov 15 Releases 🍂
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS 🪷
New Depth Models
BRAVE Models 🦁
Computer Vision Backbones 🧩
Image Classification Models 🐶 🐱
Object Detection Models 🥥
Image Segmentation Models 💜
Zero-shot Image Classification Models 🖼️
Image-to-Image Models 🎨
Video Classification Models 📺
Image-to-Text Models 📝
Text-to-Image Models 🥑
Foundation Models for Vision 🧩
Segment Anything Model
OWL-series 🦉
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers 🖼️💬📝
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Multimodal DSE Retrievers
updated
Apr 15
A collection of DSE models for multimodal retrieval
Upvote
14
+4
racineai/Flantier-SmolVLM-2B-dse
2B
•
Updated
Jun 18
•
53
•
11
MrLight/dse-qwen2-2b-mrl-v1
Visual Document Retrieval
•
Updated
Feb 26
•
8.52k
•
59
marco/mcdse-2b-v1
2B
•
Updated
Oct 29, 2024
•
5.53k
•
56
llamaindex/vdr-2b-multi-v1
Image-to-Text
•
2B
•
Updated
May 21
•
7.72k
•
118
llamaindex/vdr-2b-v1
Image-to-Text
•
2B
•
Updated
Jan 10
•
1.69k
•
13
Upvote
14
+10
Share collection
View history
Collection guide
Browse collections