Update README.md
Browse files
README.md
CHANGED
|
@@ -46,7 +46,7 @@ We report **Normalized Discounted Cumulative Gain (NDCG)** scores, which measure
|
|
| 46 |
- **Avg. NDCG**: Average of NDCG@1, @3, @5, and @10 across all benchmark datasets.
|
| 47 |
- **NDCG@k**: Relevance quality of the top-*k* retrieved results.
|
| 48 |
|
| 49 |
-
####
|
| 50 |
Our model, **telepix/PIXIE-Rune-Preview**, achieves state-of-the-art performance across most metrics and benchmarks, demonstrating strong generalization across domains such as multi-hop QA, long-document retrieval, public health, and e-commerce.
|
| 51 |
|
| 52 |
| Model Name | # params | Avg. NDCG | NDCG@1 | NDCG@3 | NDCG@5 | NDCG@10 |
|
|
@@ -78,7 +78,7 @@ Descriptions of the benchmark datasets used for evaluation are as follows:
|
|
| 78 |
- **XPQARetrieval**
|
| 79 |
A real-world dataset constructed from user queries and relevant product documents in a Korean e-commerce platform.
|
| 80 |
|
| 81 |
-
####
|
| 82 |
Our model, **telepix/PIXIE-Rune-Preview**, achieves strong performance on a wide range of tasks, including fact verification, multi-hop question answering, financial QA, and scientific document retrieval, demonstrating competitive generalization across diverse domains.
|
| 83 |
|
| 84 |
| Model Name | # params | Avg. NDCG | NDCG@1 | NDCG@3 | NDCG@5 | NDCG@10 |
|
|
|
|
| 46 |
- **Avg. NDCG**: Average of NDCG@1, @3, @5, and @10 across all benchmark datasets.
|
| 47 |
- **NDCG@k**: Relevance quality of the top-*k* retrieved results.
|
| 48 |
|
| 49 |
+
#### 7 Datasets of MTEB (Korean)
|
| 50 |
Our model, **telepix/PIXIE-Rune-Preview**, achieves state-of-the-art performance across most metrics and benchmarks, demonstrating strong generalization across domains such as multi-hop QA, long-document retrieval, public health, and e-commerce.
|
| 51 |
|
| 52 |
| Model Name | # params | Avg. NDCG | NDCG@1 | NDCG@3 | NDCG@5 | NDCG@10 |
|
|
|
|
| 78 |
- **XPQARetrieval**
|
| 79 |
A real-world dataset constructed from user queries and relevant product documents in a Korean e-commerce platform.
|
| 80 |
|
| 81 |
+
#### 7 Datasets of BEIR (English)
|
| 82 |
Our model, **telepix/PIXIE-Rune-Preview**, achieves strong performance on a wide range of tasks, including fact verification, multi-hop question answering, financial QA, and scientific document retrieval, demonstrating competitive generalization across diverse domains.
|
| 83 |
|
| 84 |
| Model Name | # params | Avg. NDCG | NDCG@1 | NDCG@3 | NDCG@5 | NDCG@10 |
|