ondevicellm
/
tinyllama_moe_dpo_ultrafeedback_epochs5

Model card Files Files and versions Metrics Training metrics Community