1 30 6

Peter

Tempo14

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

upvoted an article 18 days ago

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

upvoted an article 18 days ago

Transformers Are Getting Old: Variants and Alternatives Exist!

View all activity

Organizations

upvoted 2 articles 18 days ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

29 days ago

• 34

Article

Transformers Are Getting Old: Variants and Alternatives Exist!

•

21 days ago

• 42

upvoted a paper 28 days ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 30 days ago • 28

upvoted 2 papers 2 months ago

Neurosymbolic Diffusion Models

Paper • 2505.13138 • Published May 19 • 34

Latent Flow Transformer

Paper • 2505.14513 • Published May 20 • 28

upvoted an article 2 months ago

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

and 9 others •

May 15

• 35

upvoted an article 3 months ago

Article

Technical Framework for Building an AGI

•

May 10

• 2

upvoted 3 articles 5 months ago

Article

What changed in the Transformer architecture

•

Mar 8

• 15

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

and 1 other •

Mar 11

• 97

Article

My Learning Journey: Understanding C

•

Mar 9

• 4

upvoted 7 papers 5 months ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 36

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 41

upvoted an article 6 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

and 3 others •

Feb 4

• 167

upvoted 2 papers 6 months ago

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31 • 22

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 125

Peter

AI & ML interests

Recent Activity

Organizations

Tempo14's activity

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Transformers Are Getting Old: Variants and Alternatives Exist!

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

Technical Framework for Building an AGI

What changed in the Transformer architecture

LeRobot goes to driving school: World’s largest open-source self-driving dataset

My Learning Journey: Understanding C

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control