ARIA: Training Language Agents with Intention-Driven Reward Aggregation Paper • 2506.00539 • Published 12 days ago • 29
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper • 2505.19641 • Published 17 days ago • 64
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles Paper • 2505.19914 • Published 17 days ago • 42
Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models Paper • 2410.13413 • Published Oct 17, 2024
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals Paper • 2406.04784 • Published Jun 7, 2024 • 2
TravelAgent: An AI Assistant for Personalized Travel Planning Paper • 2409.08069 • Published Sep 12, 2024
From Persona to Personalization: A Survey on Role-Playing Language Agents Paper • 2404.18231 • Published Apr 28, 2024 • 1
TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation Paper • 2402.05733 • Published Feb 8, 2024
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals Paper • 2406.04784 • Published Jun 7, 2024 • 2
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18 • 128
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning Paper • 2504.13914 • Published Apr 10 • 1
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles Paper • 2505.19914 • Published 17 days ago • 42