DeBERTa v3 Large News Classifier

This model is a fine-tuned version of microsoft/deberta-v3-large for multi-class classification of news headlines. The model classifies headlines into one of 7 categories:

  • World
  • Business
  • Technology
  • Entertainment
  • Sports
  • Science
  • Health

Training Dataset

This model was trained on the logicalqubit/news_133k dataset,
which contains 133,000 labeled news headlines across the 7 categories mentioned above.


Training Hyperparameters:

  • learning_rate: 6e-6

  • train_batch_size: 8

  • eval_batch_size: 8

  • warmup_steps: 50

  • num_epochs: 2

  • report_to: wandb


Sample Code

Demo Code


Sample Output

Demo Output


Metrics

  • While zero-shot models and fine-tuned text classification model differ fundamentally in design and purpose, this comparison is presented for reference. It provides a practical benchmark, even if the comparison is entirely unfair.

Metrics

Metrics


Wandb

Wandb

Wandb


Downloads last month
30
Safetensors
Model size
435M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for logicalqubit/deberta-v3-large-news-classifier

Finetuned
(164)
this model

Dataset used to train logicalqubit/deberta-v3-large-news-classifier