-
ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality
Paper • 2510.22037 • Published • 18 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 467 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 525 -
Scaling Language-Centric Omnimodal Representation Learning
Paper • 2510.11693 • Published • 97
Clément Castellon
Clemspace
AI & ML interests
Reinforcement learning, Neural Architecture Search, Transformers
Recent Activity
updated
a collection
9 days ago
Bangers 2025
updated
a collection
9 days ago
Bangers 2025
updated
a collection
9 days ago
Bangers 2025