When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding Paper • 2508.15641 • Published about 23 hours ago • 2
Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models Paper • 2508.15202 • Published 1 day ago • 2
aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists Paper • 2508.15126 • Published 1 day ago • 4
"Does the cafe entrance look accessible? Where is the door?" Towards Geospatial AI Agents for Visual Inquiries Paper • 2508.15752 • Published about 20 hours ago • 3
ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling Paper • 2508.15767 • Published about 20 hours ago • 5
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published about 20 hours ago • 9
Waver: Wave Your Way to Lifelike Video Generation Paper • 2508.15761 • Published about 20 hours ago • 9
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published about 20 hours ago • 151
SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass Paper • 2508.15769 • Published about 20 hours ago • 10
Mobile-Agent-v3: Foundamental Agents for GUI Automation Paper • 2508.15144 • Published 1 day ago • 33
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published Mar 9 • 32
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting Paper • 2508.11408 • Published 7 days ago • 6
MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers Paper • 2508.14704 • Published 2 days ago • 23
From AI for Science to Agentic Science: A Survey on Autonomous Scientific Discovery Paper • 2508.14111 • Published 4 days ago • 24
From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models Paper • 2508.13491 • Published 3 days ago • 55
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published 2 days ago • 65
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published 2 days ago • 19
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs Paper • 2508.14896 • Published 2 days ago • 19