@sergiopaniego on Hugging Face: "This summer TRL leveled up for multimodal alignment 🌞 ✅ New VLM alignment…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

sergiopaniego

posted an update 9 days ago

Post

1322

This summer TRL leveled up for multimodal alignment 🌞

✅ New VLM alignment methods (MPO, GRPO, GSPO)
✅ Extended RLOO & Online DPO for VLMs
✅ Native SFT support
✅ Ready-to-use training scripts

🔗 https://huggingface.co/blog/trl-vlm-alignment

In this post

sergiopaniego Sergio Paniego