AI & ML interests

Retrieval, Computer Vision, LLM

Recent Activity

vidore 's collections 11

ViDoRe Chunk OCR (baseline)
The ViDoRe benchmark was passed to Unstructured to partition each page into text chunks. Detected figures/tables were captioned with Claude 3-Sonnet.
ViDoRe Page OCR (artifact)
ViDoRe benchmark with the full OCR text of each page. ⚠️ This dataset serves a intermediate step → use "ViDoRe Chunk OCR (baseline)" for evaluation!
ViDoRe Benchmark (BEIR)
Benchmark for document retrieval using visual features, introduced in the ColPali paper. Datasets are using the BEIR format.
ColPali Paper Resources
Main resources for the paper: "ColPali: Efficient Document Retrieval with Vision Language Models"
ViDoRe Benchmark (BEIR)
Benchmark for document retrieval using visual features, introduced in the ColPali paper. Datasets are using the BEIR format.
ViDoRe Chunk OCR (baseline)
The ViDoRe benchmark was passed to Unstructured to partition each page into text chunks. Detected figures/tables were captioned with Claude 3-Sonnet.
ColPali Paper Resources
Main resources for the paper: "ColPali: Efficient Document Retrieval with Vision Language Models"
ViDoRe Page OCR (artifact)
ViDoRe benchmark with the full OCR text of each page. ⚠️ This dataset serves a intermediate step → use "ViDoRe Chunk OCR (baseline)" for evaluation!