tshasan's picture
Create README.md
5093161 verified
|
raw
history blame
1.19 kB
metadata
license: cc-by-4.0
tags:
  - multi-label-classification
  - text-classification
  - onnx
  - web-classification
  - firefox-ai
  - preview
language:
  - multilingual
datasets:
  - tshasan/multi-label-web-classification
base_model: Alibaba-NLP/gte-modernbert-base
pipeline_tag: text-classification

modernBERT-URLTITLE-classifier-preview

Model Overview

This is a preview version of a multi-label web classification model fine-tuned from Alibaba-NLP/gte-modernbert-base. It classifies websites into multiple categories based on their URLs and titles. The model supports 11 labels: Uncatergorized,News, Entertainment, Shop, Chat, Education, Government, Health, Technology, Work, and Travel.

  • Developed by: Taimur Hasan
  • Model Type: Multi-label Text Classification
  • Status: Preview (under active development

Architecture

  • Fine-tuning: Unfroze the last 4 encoder layers and the pooler
  • Problem Type: Multi-label classification
  • Output Labels: 11 (News, Entertainment, Shop, Chat, Education, Government, Health, Technology, Work, Travel,Uncatergorized)
  • Input Format: Concatenated string: "{url}:{title}"