πŸ† TrOCR Fine-Tuned Model (Handwritten Text Recognition)

πŸ“Œ Model Overview

This is a fine-tuned Microsoft TrOCR Large model for handwritten text recognition. It has been trained on a dataset containing scanned handwritten documents.

  • Base Model: Microsoft TrOCR Large
  • Fine-tuned On: IAM Handwritten Dataset
  • Use Case: Extract text from scanned handwritten documents
  • Framework: PyTorch + Transformers (Hugging Face)
  • Large File Support: Uses git-lfs for model files

πŸš€ How to Use This Model

You can load and use the fine-tuned model with transformers in Python as follows:

from transformers import TrOCRProcessor, VisionEncoderDecoderModel
from PIL import Image

# Load model and processor
processor = TrOCRProcessor.from_pretrained("Gitesh2003/TrOCR")
model = VisionEncoderDecoderModel.from_pretrained("Gitesh2003/TrOCR")

# Load an image
image = Image.open("handwritten_sample.jpg").convert("RGB")

# Process and predict text
pixel_values = processor(images=image, return_tensors="pt").pixel_values
generated_ids = model.generate(pixel_values)
extracted_text = processor.batch_decode(generated_ids, skip_special_tokens=True)[0]

print("Extracted Text:", extracted_text)
Downloads last month
28
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support