SmolVLM: Redefining small and efficient multimodal models Paper β’ 2504.05299 β’ Published 10 days ago β’ 160
ABC: Achieving Better Control of Multimodal Embeddings using VLMs Paper β’ 2503.00329 β’ Published Mar 1 β’ 18
SwarmFormer Collection Our collection of our frontier SwarmFormer architecture models. β’ 2 items β’ Updated Jan 24 β’ 3
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper β’ 2411.14974 β’ Published Nov 22, 2024 β’ 17
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper β’ 2411.18613 β’ Published Nov 27, 2024 β’ 57
Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper β’ 2410.19008 β’ Published Oct 21, 2024 β’ 24
T3M: Text Guided 3D Human Motion Synthesis from Speech Paper β’ 2408.12885 β’ Published Aug 23, 2024 β’ 13
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper β’ 2408.06292 β’ Published Aug 12, 2024 β’ 124
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks Paper β’ 2407.02855 β’ Published Jul 3, 2024 β’ 13
Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper β’ 2407.04620 β’ Published Jul 5, 2024 β’ 32