Submitted by AaronHuangWei 14 EmbRACE-3K: Embodied Reasoning and Action in Complex Environments · 9 authors 2 1
Submitted by dorni 12 SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation · 9 authors 2
Submitted by Liang0223 3 LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers · 10 authors 1
Submitted by zsytony 2 CompassJudger-2: Towards Generalist Judge Model via Verifiable Rewards · 5 authors 96 1
Submitted by raymin0223 1 Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation · 11 authors 1