metadata

language: en
license: apache-2.0
tags:
  - masked-language-modeling
  - imdb
  - distilbert
  - domain-adaptation
  - tensorflow
  - movie-reviews
pipeline_tag: fill-mask
widget:
  - text: This movie was absolutely [MASK].

distilbert-base-uncased-finetuned-imdb

This is a DistilBERT model fine-tuned using Masked Language Modeling (MLM) on the IMDB movie reviews dataset.
It is domain-adapted specifically for understanding and completing movie-related text.

Model Details

Base model: distilbert-base-uncased
Training objective: Masked Language Modeling
Domain: English movie reviews (IMDB)
Framework: TensorFlow / Keras
Training time: ~3 hours on Google Colab
Chunk size: 128 tokens

Use Case

This model is ideal for:

Autocompletion of masked tokens in movie reviews
Domain-aware masked language modeling
Sentence generation or augmentation in film-related contexts

Example Usage

from transformers import pipeline

fill_mask = pipeline(
    "fill-mask",
    model="Prathamesh2403/distilbert-base-uncased-finetuned-imdb",
    tokenizer="Prathamesh2403/distilbert-base-uncased-finetuned-imdb"
)

fill_mask("This movie was absolutely [MASK].")