RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents Paper • 2507.03112 • Published 9 days ago • 31
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published 20 days ago • 32 • 8