jinaai
/

jina-embeddings-v4

Visual Document Retrieval

sentence-transformers

feature-extraction

multimodal-embedding

multilingual-embedding

Text-to-Visual Document (T→VD) retrieval

sentence-similarity

🇪🇺 Region: EU

Model card Files Files and versions Community

michael-guenther commited on Jun 18

Commit

9349adf

·

verified ·

1 Parent(s): 8e64b38

Update vidore_eval.md

Files changed (1) hide show

vidore_eval.md +9 -1

vidore_eval.md CHANGED Viewed

@@ -15,4 +15,12 @@ vidore-benchmark evaluate-retriever \
     --collection-name jinaai/document-screenshot-retrieval-benchmark-small-684831c022c53b21c313b449 \
     --dataset-format qa \
     --split test
-```

     --collection-name jinaai/document-screenshot-retrieval-benchmark-small-684831c022c53b21c313b449 \
     --dataset-format qa \
     --split test
+```
+## Evaluate Pure Text Retrieval Models on Refined Vidore Tasks
+The original Vidore dataset contain multiple text chunks per image to evaluate text retrieval models on them.
+Those text chunks are  extracted from the document pages using different tools like [Unstructured](https://github.com/Unstructured-IO/unstructured), OCR models, and LLMs.
+For evaluating text retrieval models on our filtered versions of the Vidore datasets, you can use the datasets in the collection `https://huggingface.co/collections/jinaai/jina-vdr-vidoreocr-tasks-6852cfc55ccf837e7fecfa1b`.
+It is also possible to evaluate jina-embeddings-v4 and other vision retrieval models on them. This however takes more time and should lead to the same evaluation results as running the vesions of the datasets in the Jina VDR collection.