Breast Cancer Classification with Swin Transformer V2 Large

This repository contains a fine-tuned Swin Transformer V2 Large model for breast cancer classification based on mammography images.

Due to the indistinguishable nature of the dataset various runs had been conducted to perform the original 3 classes classification according to the original DDSM dataset but the accuracy obtained is dismal (approx 67%) contrary to literature review of (>90%).

I have also explored dual input Swin Transformer using the Tumour Mask, however, similar dismal accuracy is obtained. We can look at the dataset and notice that the images all looks about the same except Normal. Thus, the detection strategy becomes detecting the presence of cancer by merging to Benign and Cancer images as a class against the Normal images.

With such approach, accuracy significant increases and achieve reliable performance.

Model Description

The model is based on the Swin Transformer V2 Large architecture, fine-tuned on the Mini-DDBS-JPEG dataset for breast cancer classification. It uses a custom classification head consisting of a multi-layer perceptron with dropout for regularization.

Key Features

Based on Swin Transformer V2 Large architecture
Input image size: 256x256 pixels
Binary classification task (malignant vs benign)
Mixed precision training for improved performance

Performance

The model was trained with class balancing techniques to handle data imbalance. Performance metrics on the test set:

Metric	Value
Test Accuracy	0.8644501278772379
Test Loss	0.31417657015726086

For detailed performance metrics including precision, recall, and F1-score per class, please check the training notebook.

Usage

With Transformers Pipeline

from transformers import pipeline

classifier = pipeline("image-classification", model="keanteng/swin-v2-large-ft-breast-cancer-classification-0603")
result = classifier("path/to/mammogram.jpg")
print(result)

from transformers import AutoFeatureExtractor, AutoModelForImageClassification
from PIL import Image

# Load model and feature extractor
model = AutoModelForImageClassification.from_pretrained("keanteng/swin-v2-ft-large-breast-cancer-classification-0603")
feature_extractor = AutoFeatureExtractor.from_pretrained("keanteng/swin-v2-ft-large-breast-cancer-classification-0603")

# Prepare image
image = Image.open("path/to/mammogram.jpg").convert("RGB")
inputs = feature_extractor(images=image, return_tensors="pt")

# Get prediction
outputs = model(**inputs)
predicted_class_idx = outputs.logits.argmax(-1).item()
print(f"Predicted class: model.config.id2label[predicted_class_idx]")

keanteng
/

swin-v2-large-ft-breast-cancer-classification-0603

Breast Cancer Classification with Swin Transformer V2 Large

Model Description

Key Features

Performance

Usage

With Transformers Pipeline

Model tree for keanteng/swin-v2-large-ft-breast-cancer-classification-0603

Dataset used to train keanteng/swin-v2-large-ft-breast-cancer-classification-0603

Collection including keanteng/swin-v2-large-ft-breast-cancer-classification-0603

Research Project 🧠