Submitted by WZDavid 59 Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs · 20 authors 102 2
Submitted by SivilTaram 28 SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories? · 8 authors 10 1
Submitted by zhendongucb 28 DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering · 3 authors 40 1
Submitted by vztu 21 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding · 7 authors 1
Submitted by Franck-Dernoncourt 10 Lizard: An Efficient Linearization Framework for Large Language Models · 12 authors 1
Submitted by crainone 8 Replacing thinking with tool usage enables reasoning in small language models · 3 authors 2
Submitted by HenghuiDing 8 AnyI2V: Animating Any Conditional Image with Motion Control · 4 authors 86 1
Submitted by Xa9aX 3 GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities · 12 authors 1
Submitted by hongzhizhang 3 RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning · 7 authors 16 1
Submitted by MatteoFasulo 2 AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles · 3 authors 2 1
Submitted by Gray1y - MST-Distill: Mixture of Specialized Teachers for Cross-Modal Knowledge Distillation · 6 authors 12 1