CREAM: Consistency Regularized Self-Rewarding Language Models Paper • 2410.12735 • Published Oct 16, 2024
REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training Paper • 2505.16792 • Published May 22