LeanK: Learnable K Cache Channel Pruning for Efficient Decoding Paper • 2508.02215 • Published 20 days ago • 11
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published 19 days ago • 61
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Paper • 2505.12929 • Published May 19 • 3