view article Article AI Policy: π€ Response to the White House AI Action Plan RFI 5 days ago β’ 20
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets 6 days ago β’ 27
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper β’ 2503.01743 β’ Published 20 days ago β’ 77
Token-Efficient Long Video Understanding for Multimodal LLMs Paper β’ 2503.04130 β’ Published 18 days ago β’ 83
RuCCoD: Towards Automated ICD Coding in Russian Paper β’ 2502.21263 β’ Published 23 days ago β’ 122
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper β’ 2503.03601 β’ Published 18 days ago β’ 215
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 208
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper β’ 2502.08946 β’ Published Feb 13 β’ 186
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper β’ 2502.01061 β’ Published Feb 3 β’ 191
Training Language Models to Self-Correct via Reinforcement Learning Paper β’ 2409.12917 β’ Published Sep 19, 2024 β’ 138
Addition is All You Need for Energy-efficient Language Models Paper β’ 2410.00907 β’ Published Oct 1, 2024 β’ 148
CLEAR: Character Unlearning in Textual and Visual Modalities Paper β’ 2410.18057 β’ Published Oct 23, 2024 β’ 208
ROICtrl: Boosting Instance Control for Visual Generation Paper β’ 2411.17949 β’ Published Nov 27, 2024 β’ 84
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published Nov 7, 2024 β’ 120