GHPO: Adaptive Guidance for Stable and Efficient LLM Reinforcement Learning Paper • 2507.10628 • Published Jul 14 • 1