Variance Reduced Policy Gradient Method for Multi-Objective Reinforcement Learning Paper • 2508.10608 • Published Aug 14, 2025 • 1