Add pipeline tag, library name, link to code and copy Github README
Browse filesThis PR ensures the model is tagged correctly, and people can find the model under the "feature-extraction" tag. It also adds the
library name to display an inferencing widget in the top right corner. It also copies the content of the Github README to the model card.
README.md
CHANGED
|
@@ -1,3 +1,32 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
pipeline_tag: feature-extraction
|
| 4 |
+
library_name: transformers
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
# Set-Encoder
|
| 8 |
+
|
| 9 |
+
This repository contains the code for the paper: [`Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders`](https://arxiv.org/abs/2404.06912).
|
| 10 |
+
|
| 11 |
+
We use [`lightning-ir`](https://github.com/webis-de/lightning-ir) to train and fine-tune models. Download and install the library to use the code in this repository.
|
| 12 |
+
|
| 13 |
+
## Model Zoo
|
| 14 |
+
|
| 15 |
+
We provide the following pre-trained models:
|
| 16 |
+
|
| 17 |
+
| Model Name | TREC DL 19 (BM25) | TREC DL 20 (BM25) | TREC DL 19 (ColBERTv2) | TREC DL 20 (ColBERTv2) |
|
| 18 |
+
| ------------------------------------------------------------------- | ----------------- | ----------------- | ---------------------- | ---------------------- |
|
| 19 |
+
| [set-encoder-base](https://huggingface.co/webis/set-encoder-base) | 0.724 | 0.710 | 0.788 | 0.777 |
|
| 20 |
+
| [set-encoder-large](https://huggingface.co/webis/set-encoder-large) | 0.727 | 0.735 | 0.789 | 0.790 |
|
| 21 |
+
|
| 22 |
+
## Inference
|
| 23 |
+
|
| 24 |
+
We recommend using the `lightning-ir` cli to run inference. The following command can be used to run inference using the `set-encoder-base` model on the TREC DL 19 and TREC DL 20 datasets:
|
| 25 |
+
|
| 26 |
+
```bash
|
| 27 |
+
lightning-ir re_rank --config configs/re-rank.yaml --config configs/set-encoder-finetuned.yaml --config configs/trec-dl.yaml
|
| 28 |
+
```
|
| 29 |
+
|
| 30 |
+
## Fine-Tuning
|
| 31 |
+
|
| 32 |
+
WIP
|