Submitted by zichenwen 52 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs · 14 authors 43 2
Submitted by korallll 44 A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models · 7 authors 9 2
Submitted by yukimasano 18 Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning · 8 authors 89 3
Submitted by nqbinh 16 CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models · 5 authors 4
Submitted by Holarissun 11 Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities · 2 authors 1
Submitted by wzk1015 10 Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models · 12 authors 58 1
Submitted by shikhar7ssu 5 OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder · 7 authors 1
Submitted by psp-dada 5 Mitigating Object Hallucinations via Sentence-Level Early Intervention · 4 authors 5 1
Submitted by Hiiamein 5 RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services · 25 authors 2
Submitted by gonzmart 4 The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations · 5 authors 1
Submitted by 0xnu 3 Quantitative Risk Management in Volatile Markets with an Expectile-Based Framework for the FTSE Index · 1 authors 0 1