tshasan's picture
Create README.md
5093161 verified
|
raw
history blame
1.19 kB
---
license: cc-by-4.0
tags:
- multi-label-classification
- text-classification
- onnx
- web-classification
- firefox-ai
- preview
language:
- multilingual
datasets:
- tshasan/multi-label-web-classification
base_model: Alibaba-NLP/gte-modernbert-base
pipeline_tag: text-classification
---
# modernBERT-URLTITLE-classifier-preview
## Model Overview
This is a **preview version** of a multi-label web classification model fine-tuned from `Alibaba-NLP/gte-modernbert-base`. It classifies websites into multiple categories based on their URLs and titles. The model supports 11 labels: `Uncatergorized`,`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, and `Travel`.
- **Developed by**: Taimur Hasan
- **Model Type**: Multi-label Text Classification
- **Status**: Preview (under active development
### Architecture
- **Fine-tuning**: Unfroze the last 4 encoder layers and the pooler
- **Problem Type**: Multi-label classification
- **Output Labels**: 11 (`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, `Travel`,`Uncatergorized`)
- **Input Format**: Concatenated string: `"{url}:{title}"`