Commit
·
094fcb6
1
Parent(s):
6cd60e2
Changed dataset name, added sentence transformer references back in
Browse files
README.md
CHANGED
|
@@ -39,9 +39,9 @@ model-index:
|
|
| 39 |
name: Cosine Ap
|
| 40 |
---
|
| 41 |
|
| 42 |
-
#
|
| 43 |
|
| 44 |
-
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base) on the Quora
|
| 45 |
|
| 46 |
## Model Details
|
| 47 |
|
|
@@ -52,7 +52,7 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [A
|
|
| 52 |
- **Output Dimensionality:** 768 dimensions
|
| 53 |
- **Similarity Function:** Cosine Similarity
|
| 54 |
- **Training Dataset:**
|
| 55 |
-
- Quora
|
| 56 |
<!-- - **Language:** Unknown -->
|
| 57 |
<!-- - **License:** Unknown -->
|
| 58 |
|
|
@@ -115,17 +115,17 @@ print(similarities.shape)
|
|
| 115 |
|
| 116 |
### Training Dataset
|
| 117 |
|
| 118 |
-
####
|
| 119 |
|
| 120 |
-
* Dataset:
|
| 121 |
* Size: training samples
|
| 122 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
| 123 |
|
| 124 |
### Evaluation Dataset
|
| 125 |
|
| 126 |
-
####
|
| 127 |
|
| 128 |
-
* Dataset:
|
| 129 |
* Size: evaluation samples
|
| 130 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
| 131 |
|
|
@@ -133,15 +133,18 @@ print(similarities.shape)
|
|
| 133 |
|
| 134 |
### BibTeX
|
| 135 |
|
|
|
|
|
|
|
| 136 |
#### Sentence Transformers
|
| 137 |
```bibtex
|
| 138 |
-
@inproceedings{
|
| 139 |
-
title = "",
|
| 140 |
-
author = "",
|
| 141 |
-
|
| 142 |
-
|
| 143 |
-
|
| 144 |
-
|
|
|
|
| 145 |
}
|
| 146 |
```
|
| 147 |
|
|
|
|
| 39 |
name: Cosine Ap
|
| 40 |
---
|
| 41 |
|
| 42 |
+
# Redis Semantic Caching embedding model based on Alibaba-NLP/gte-modernbert-base
|
| 43 |
|
| 44 |
+
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base) on the [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity for the purpose of semantic caching.
|
| 45 |
|
| 46 |
## Model Details
|
| 47 |
|
|
|
|
| 52 |
- **Output Dimensionality:** 768 dimensions
|
| 53 |
- **Similarity Function:** Cosine Similarity
|
| 54 |
- **Training Dataset:**
|
| 55 |
+
- [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset)
|
| 56 |
<!-- - **Language:** Unknown -->
|
| 57 |
<!-- - **License:** Unknown -->
|
| 58 |
|
|
|
|
| 115 |
|
| 116 |
### Training Dataset
|
| 117 |
|
| 118 |
+
#### Quora
|
| 119 |
|
| 120 |
+
* Dataset: [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset)
|
| 121 |
* Size: training samples
|
| 122 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
| 123 |
|
| 124 |
### Evaluation Dataset
|
| 125 |
|
| 126 |
+
#### Quora
|
| 127 |
|
| 128 |
+
* Dataset: [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset)
|
| 129 |
* Size: evaluation samples
|
| 130 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
| 131 |
|
|
|
|
| 133 |
|
| 134 |
### BibTeX
|
| 135 |
|
| 136 |
+
#### Redis Langcache-embed Models
|
| 137 |
+
|
| 138 |
#### Sentence Transformers
|
| 139 |
```bibtex
|
| 140 |
+
@inproceedings{reimers-2019-sentence-bert,
|
| 141 |
+
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
| 142 |
+
author = "Reimers, Nils and Gurevych, Iryna",
|
| 143 |
+
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
| 144 |
+
month = "11",
|
| 145 |
+
year = "2019",
|
| 146 |
+
publisher = "Association for Computational Linguistics",
|
| 147 |
+
url = "https://arxiv.org/abs/1908.10084",
|
| 148 |
}
|
| 149 |
```
|
| 150 |
|