lole25
/
phi-2-dpo-ultrafeedback-lora

Model card Files Files and versions Metrics Training metrics Community