SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 10 days ago • 160
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay Paper • 2504.03601 • Published 13 days ago • 15
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 17 days ago • 241
MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization Paper • 2503.16874 • Published 27 days ago • 44
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 30 days ago • 137
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published Mar 16 • 43
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5 • 228
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published Mar 6 • 93
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2 • 62
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published Feb 27 • 30