Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 4 days ago • 127
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control Paper • 2506.01943 • Published 4 days ago • 23
Scaling Image and Video Generation via Test-Time Evolutionary Search Paper • 2505.17618 • Published 14 days ago • 39
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published 29 days ago • 78
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published 29 days ago • 78
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published 29 days ago • 78 • 4