BioClinicalBERT-based Triage Classification Model Documentation
Model Overview
This documentation outlines the fine-tuned BioClinicalBERT model for medical triage classification.
Model Name: BioClinicalBERT-Triage
Base Model: emilyalsentzer/Bio_ClinicalBERT
Task: Medical triage classification
Classes: Emergency, Urgent, Non-Urgent, Routine, Follow-up
Training Dataset Size: 34,010 samples
Validation Dataset Size: 8,503 samples
Model Metrics

Final Training Loss: 0.3246
Training Samples Per Second: 13.99
Training Time: Approximately 2 hours

Model Description
This model was fine-tuned from the BioClinicalBERT foundation model to classify medical symptoms into appropriate triage categories. It's designed to support healthcare professionals in prioritizing patient care based on symptom descriptions. The model processes text descriptions of symptoms and medical history to predict one of the predefined triage categories.
How to Use
pythonfrom transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline

# Load model and tokenizer
tokenizer = AutoTokenizer.from_pretrained("YourUsername/BioClinicalBERT-Triage")
model = AutoModelForSequenceClassification.from_pretrained("VolodymyrPugachov/BioClinicalBERT-Triage")

# Create classification pipeline
classifier = pipeline(
    "text-classification",
    model=model,
    tokenizer=tokenizer,
    top_k=None
)

# Example usage
symptoms = "I'm having severe chest pain that radiates to my left arm and jaw. I'm also feeling short of breath and nauseous."
medical_history = "History of high blood pressure"
text_input = f"{symptoms} {medical_history}"

# Get prediction
results = classifier(text_input)
print(results)
Limitations

The model has been trained on specific medical text data and may not generalize well to significantly different symptom descriptions or medical specialties.
It should be used as a supportive tool for healthcare professionals, not as a replacement for clinical judgment.
Performance may vary for rare or complex medical conditions not well-represented in the training data.

pythonfrom transformers import AutoModelForSequenceClassification, AutoTokenizer

# Load from Hugging Face Hub
model = AutoModelForSequenceClassification.from_pretrained("VolodymyrPugachov/BioClinicalBERT-Triage")
tokenizer = AutoTokenizer.from_pretrained("VolodymyrPugachov/BioClinicalBERT-Triage")