llavallava
/
smolvlm-instruct-trl-dpo-0_0.1_epochs1_ref

Model card Files Files and versions Metrics Training metrics Community