Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,18 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language: en
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
# Sparse BERT base model (uncased)
|
| 6 |
+
|
| 7 |
+
Pretrained model pruned to 1:2 structured sparsity.
|
| 8 |
+
The model is a pruned version of the [BERT base model](https://huggingface.co/bert-base-uncased).
|
| 9 |
+
|
| 10 |
+
## Intended Use
|
| 11 |
+
|
| 12 |
+
The model can be used for fine-tuning to downstream tasks with sparsity already embeded to the model.
|
| 13 |
+
To keep the sparsity a mask should be added to each sparse weight blocking the optimizer from updating the zeros.
|
| 14 |
+
|
| 15 |
+
## Evaluation Results
|
| 16 |
+
| Task | MNLI-m (Acc) | MNLI-mm (Acc) | QQP (Acc/F1) | QNLI (Acc) | SST-2 (Acc) | STS-B (Pears/Spear) | SQuADv1.1 (Acc/F1) |
|
| 17 |
+
|------|--------------|---------------|--------------|------------|-------------|---------------------|--------------------|
|
| 18 |
+
| | 83.3 | 83.9 | 90.8/87.6 | 90.4 | 91.3 | 88.8/88.3 | 80.5/88.2 |
|