TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence Paper • 2505.24500 • Published 13 days ago • 11
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Paper • 2503.21696 • Published Mar 27 • 22
CLIP-AD: A Language-Guided Staged Dual-Path Model for Zero-shot Anomaly Detection Paper • 2311.00453 • Published Nov 1, 2023
LLaVA-KD: A Framework of Distilling Multimodal Large Language Models Paper • 2410.16236 • Published Oct 21, 2024
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network Paper • 2411.15941 • Published Nov 24, 2024 • 2
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection Paper • 2404.06564 • Published Apr 9, 2024
On the Trajectory Regularity of ODE-based Diffusion Sampling Paper • 2405.11326 • Published May 18, 2024 • 1
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection Paper • 2308.14286 • Published Aug 28, 2023
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model Paper • 2406.19905 • Published Jun 28, 2024