Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
galois77
's Collections
Multi-language
Agentic
Multimodal
Inference
Check-later
Videos
ahan
Image generation
Training optimization
RL
Reasoning
Benchmarks and challenges
Instructions
Evaluators
Training optimization
updated
4 days ago
Upvote
-
The Curse of Depth in Large Language Models
Paper
•
2502.05795
•
Published
Feb 9
•
40
Transformers without Normalization
Paper
•
2503.10622
•
Published
Mar 13
•
163
Parallel Scaling Law for Language Models
Paper
•
2505.10475
•
Published
7 days ago
•
72
Upvote
-
Share collection
View history
Collection guide
Browse collections