Chaoqi Wang's picture

1 1

Chaoqi Wang

alecwangcq

·

AI & ML interests

RL \cap LLMs

Organizations

authored 2 papers over 1 year ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 56

Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints

Paper • 2309.16240 • Published Sep 28, 2023