Enpei Zhao's picture

1 3

Enpei Zhao

enpeizhao

·

AI & ML interests

None yet

Recent Activity

updated a Space about 19 hours ago

enpeizhao/VLM_ODD_Online_Demo

published a Space 5 days ago

enpeizhao/VLM_ODD_Online_Demo

replied to sergiopaniego's post 5 days ago

Yet Another New Multimodal Fine-Tuning Recipe 🥧 🧑‍🍳 In this @HuggingFace Face Cookbook notebook, we demonstrate how to align a multimodal model (VLM) using Mixed Preference Optimization (MPO) using trl. 💡 This recipe is powered by the new MPO support in trl, enabled through a recent upgrade to the DPO trainer! We align the multimodal model using multiple optimization objectives (losses), guided by a preference dataset (chosen vs. rejected multimodal pairs). Check it out! ➡️ https://huggingface.co/learn/cookbook/fine_tuning_vlm_mpo

View all activity

Organizations

None yet

liked a model 5 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5.84M • • 1.08k

liked a model about 2 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 412k • • 1.49k

liked a model 4 months ago

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 503k • • 411