TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence Paper • 2505.24500 • Published 15 days ago • 11
S^3c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners Paper • 2409.01524 • Published Sep 3, 2024 • 1
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation Paper • 2506.03139 • Published 10 days ago • 14
Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? Paper • 2505.16998 • Published 22 days ago • 2
ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models Paper • 2505.21500 • Published 17 days ago • 11
Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning Paper • 2505.14684 • Published 24 days ago • 23
Let LLMs Break Free from Overthinking via Self-Braking Tuning Paper • 2505.14604 • Published 24 days ago • 23
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models Paper • 2505.15801 • Published 23 days ago • 17
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models Paper • 2503.06692 • Published Mar 9 • 2