Curated SFT datasets for instruction-following and conversational fine-tuning
Behrooz Azarkhalili
ermiaazarkhalili
AI & ML interests
LLMs, VLMs, PEFT, RL for LLMs and VLMs.
Recent Activity
published
a model
2 days ago
ermiaazarkhalili/SmolLM2-135M-Instruct-GRPO-NuminaMath-50K
published
a model
2 days ago
ermiaazarkhalili/SmolLM2-1.7B-Instruct-GRPO-NuminaMath-50K
published
a model
2 days ago
ermiaazarkhalili/LFM2-2.6B-GRPO-NuminaMath-50K