arxiv:2412.09871
Mike Lewis
mikelewis0
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
Byte Latent Transformer: Patches Scale Better Than Tokens
authored
a paper
4 months ago
Law of the Weakest Link: Cross Capabilities of Large Language Models
authored
a paper
6 months ago
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware
Experts
Organizations
None yet
Papers
11
models
None public yet