|
--- |
|
license: cc-by-4.0 |
|
tags: |
|
- multi-label-classification |
|
- text-classification |
|
- onnx |
|
- web-classification |
|
- firefox-ai |
|
- preview |
|
language: |
|
- multilingual |
|
datasets: |
|
- tshasan/multi-label-web-classification |
|
base_model: Alibaba-NLP/gte-modernbert-base |
|
pipeline_tag: text-classification |
|
--- |
|
|
|
# modernBERT-URLTITLE-classifier-preview |
|
|
|
## Model Overview |
|
|
|
This is a **preview version** of a multi-label web classification model fine-tuned from `Alibaba-NLP/gte-modernbert-base`. It classifies websites into multiple categories based on their URLs and titles. The model supports 11 labels: `Uncatergorized`,`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, and `Travel`. |
|
|
|
- **Developed by**: Taimur Hasan |
|
- **Model Type**: Multi-label Text Classification |
|
- **Status**: Preview (under active development |
|
### Architecture |
|
- **Fine-tuning**: Unfroze the last 4 encoder layers and the pooler |
|
- **Problem Type**: Multi-label classification |
|
- **Output Labels**: 11 (`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, `Travel`,`Uncatergorized`) |
|
- **Input Format**: Concatenated string: `"{url}:{title}"` |
|
|