Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection Paper • 2605.30344 • Published 11 days ago • 1
Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection Paper • 2605.30344 • Published 11 days ago • 1
Evaluating Cognitive Age Alignment in Interactive AI Agents Paper • 2605.17894 • Published 21 days ago • 5
3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding Paper • 2604.08645 • Published Apr 9 • 2
mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale Paper • 2506.21550 • Published Jun 26, 2025
DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising Paper • 2603.19216 • Published Mar 19 • 1
VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs Paper • 2603.23481 • Published Mar 24 • 7
Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics Paper • 2604.08503 • Published Apr 9 • 7
Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics Paper • 2604.08503 • Published Apr 9 • 7
VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs Paper • 2603.23481 • Published Mar 24 • 7
DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising Paper • 2603.19216 • Published Mar 19 • 1
Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching Paper • 2602.12221 • Published Feb 12 • 6
Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching Paper • 2602.12221 • Published Feb 12 • 6
Uncertainty in Action: Confidence Elicitation in Embodied Agents Paper • 2503.10628 • Published Mar 13, 2025
Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting Paper • 2506.17212 • Published Jun 20, 2025