Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Enpei Zhao's picture
1 3

Enpei Zhao

enpeizhao
WritingGuru's profile picture
Ā·

AI & ML interests

None yet

Recent Activity

updated a Space about 19 hours ago
enpeizhao/VLM_ODD_Online_Demo
published a Space 5 days ago
enpeizhao/VLM_ODD_Online_Demo
replied to sergiopaniego's post 5 days ago
Yet Another New Multimodal Fine-Tuning Recipe 🄧 šŸ§‘ā€šŸ³ In this @HuggingFace Face Cookbook notebook, we demonstrate how to align a multimodal model (VLM) using Mixed Preference Optimization (MPO) using trl. šŸ’” This recipe is powered by the new MPO support in trl, enabled through a recent upgrade to the DPO trainer! We align the multimodal model using multiple optimization objectives (losses), guided by a preference dataset (chosen vs. rejected multimodal pairs). Check it out! āž”ļø https://huggingface.co/learn/cookbook/fine_tuning_vlm_mpo
View all activity

Organizations

None yet

liked a model 5 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5.84M • • 1.08k
liked a model about 2 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 412k • • 1.49k
liked a model 4 months ago

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 503k • • 411
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs