MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Paper • 2407.04842 • Published Jul 5, 2024 • 56
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints Paper • 2309.16240 • Published Sep 28, 2023