-
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 61 -
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning
Paper • 2508.03501 • Published • 53 -
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience
Paper • 2508.04700 • Published • 47 -
RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Lifelong Learning in Physical Embodied Systems
Paper • 2508.01415 • Published • 7
Anni Li
neutrino12
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
neutrino12/tensorstax-sft-format-plan-mix-lr5e-5-2262
published
a model
2 days ago
neutrino12/tensorstax-sft-format-plan-mix-lr5e-5-2262
upvoted
an
article
2 days ago
From GRPO to DAPO and GSPO: What, Why, and How