Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published 21 days ago • 43
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Paper • 2503.12937 • Published 21 days ago • 27
Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks Paper • 2503.11514 • Published 25 days ago • 15
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems Paper • 2502.19328 • Published Feb 26 • 22
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning Paper • 2504.00891 • Published 5 days ago • 10
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 6 days ago • 159