🎛️Fine-tuning LLMs
Collection
Collection of fine-tuned LLMs.
•
29 items
•
Updated
•
1
This model is a fine-tuned version of openai/gpt-oss-20b on the FreedomIntelligence/medical-o1-verifiable-problem dataset. It has been trained using TRL.
from transformers import pipeline
prompt = """<|start|>developer<|message|># Instructions
You are a medical expert with advanced knowledge in clinical reasoning and diagnostics. Respond with ONLY the final diagnosis/cause in ≤5 words.<|end|><|start|>user<|message|>An 88-year-old woman with osteoarthritis is experiencing mild epigastric discomfort and has vomited material resembling coffee grounds multiple times. Considering her use of naproxen, what is the most likely cause of her gastrointestinal blood loss?<|end|><|start|>assistant<|message|>"""
# Load pipeline
generator = pipeline(
"text-generation",
model="kingabzpro/gpt-oss-20b-medical-qa",
device="cuda" # or device=0
)
# Run inference (passing in chat-style format)
output = generator(
prompt,
max_new_tokens=20,
return_full_text=False
)[0]
print(output["generated_text"])
Output:
NSAID use
This model was trained with SFT.
Base model
openai/gpt-oss-20b