GRAM: A Generative Foundation Reward Model for Reward Generalization Paper • 2506.14175 • Published 15 days ago • 1
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data Paper • 2408.12109 • Published Aug 22, 2024 • 1
A Controlled Study on Long Context Extension and Generalization in LLMs Paper • 2409.12181 • Published Sep 18, 2024 • 46