Update README.md
Browse files
README.md
CHANGED
|
@@ -1,23 +1,31 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
language:
|
| 4 |
-
- en
|
| 5 |
-
pipeline_tag: feature-extraction
|
| 6 |
tags:
|
| 7 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
---
|
| 9 |
-
|
| 10 |
<br><br>
|
| 11 |
|
| 12 |
<p align="center">
|
| 13 |
-
<img src="https://
|
| 14 |
</p>
|
| 15 |
|
| 16 |
|
| 17 |
<p align="center">
|
| 18 |
-
<b>The text embedding set trained by <a href="https://jina.ai/"><b>Jina AI</b></a
|
| 19 |
</p>
|
| 20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
|
| 22 |
## Intended Usage & Model Info
|
| 23 |
|
|
@@ -35,17 +43,11 @@ This makes our model useful for a range of use cases, especially when processing
|
|
| 35 |
This model has 137 million parameters, which enables fast and memory efficient inference, while delivering impressive performance.
|
| 36 |
Additionally, we provide the following embedding models:
|
| 37 |
|
| 38 |
-
**V1 (Based on T5, 512 Seq)**
|
| 39 |
-
|
| 40 |
-
- [`jina-embeddings-v1-small-en`](https://huggingface.co/jinaai/jina-embedding-s-en-v1): 35 million parameters.
|
| 41 |
-
- [`jina-embeddings-v1-base-en`](https://huggingface.co/jinaai/jina-embedding-b-en-v1): 110 million parameters.
|
| 42 |
-
- [`jina-embeddings-v1-large-en`](https://huggingface.co/jinaai/jina-embedding-l-en-v1): 330 million parameters.
|
| 43 |
-
|
| 44 |
-
**V2 (Based on JinaBert, 8k Seq)**
|
| 45 |
-
|
| 46 |
- [`jina-embeddings-v2-small-en`](https://huggingface.co/jinaai/jina-embeddings-v2-small-en): 33 million parameters.
|
| 47 |
-
- [`jina-embeddings-v2-base-en`](https://huggingface.co/jinaai/jina-embeddings-v2-base-en): 137 million parameters
|
| 48 |
-
- [`jina-embeddings-v2-base-
|
|
|
|
|
|
|
| 49 |
|
| 50 |
**<details><summary>Supported (Programming) Languages</summary>**
|
| 51 |
<p>
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
tags:
|
| 3 |
+
- sentence-transformers
|
| 4 |
+
- feature-extraction
|
| 5 |
+
- sentence-similarity
|
| 6 |
+
- mteb
|
| 7 |
+
datasets:
|
| 8 |
+
- allenai/c4
|
| 9 |
+
language: en
|
| 10 |
+
inference: false
|
| 11 |
+
license: apache-2.0
|
| 12 |
---
|
| 13 |
+
<!-- TODO: add evaluation results here -->
|
| 14 |
<br><br>
|
| 15 |
|
| 16 |
<p align="center">
|
| 17 |
+
<img src="https://aeiljuispo.cloudimg.io/v7/https://cdn-uploads.huggingface.co/production/uploads/603763514de52ff951d89793/AFoybzd5lpBQXEBrQHuTt.png?w=200&h=200&f=face" alt="Finetuner logo: Finetuner helps you to create experiments in order to improve embeddings on search tasks. It accompanies you to deliver the last mile of performance-tuning for neural search applications." width="150px">
|
| 18 |
</p>
|
| 19 |
|
| 20 |
|
| 21 |
<p align="center">
|
| 22 |
+
<b>The text embedding set trained by <a href="https://jina.ai/"><b>Jina AI</b></a>.</b>
|
| 23 |
</p>
|
| 24 |
|
| 25 |
+
## Quick Start
|
| 26 |
+
|
| 27 |
+
The easiest way to starting using `jina-embeddings-v2-base-en` is to use Jina AI's [Embedding API](https://jina.ai/embeddings/).
|
| 28 |
+
|
| 29 |
|
| 30 |
## Intended Usage & Model Info
|
| 31 |
|
|
|
|
| 43 |
This model has 137 million parameters, which enables fast and memory efficient inference, while delivering impressive performance.
|
| 44 |
Additionally, we provide the following embedding models:
|
| 45 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 46 |
- [`jina-embeddings-v2-small-en`](https://huggingface.co/jinaai/jina-embeddings-v2-small-en): 33 million parameters.
|
| 47 |
+
- [`jina-embeddings-v2-base-en`](https://huggingface.co/jinaai/jina-embeddings-v2-base-en): 137 million parameters **(you are here)**.
|
| 48 |
+
- [`jina-embeddings-v2-base-zh`](https://huggingface.co/jinaai/jina-embeddings-v2-base-zh): Chinese-English Bilingual embeddings.
|
| 49 |
+
- [`jina-embeddings-v2-base-de`](https://huggingface.co/jinaai/jina-embeddings-v2-base-de): German-English Bilingual embeddings.
|
| 50 |
+
- [`jina-embeddings-v2-base-es`](https://huggingface.co/jinaai/jina-embeddings-v2-base-es): Spanish-English Bilingual embeddings (soon).
|
| 51 |
|
| 52 |
**<details><summary>Supported (Programming) Languages</summary>**
|
| 53 |
<p>
|