Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yamatazen 's Collections
LLM merging
Multilingual LLMs
Japanese LLMs
LLM censorship
LLM leaderboards
Grokking

Grokking

updated Feb 4
Upvote
1

  • Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

    Paper • 2405.15071 • Published May 23, 2024 • 42

  • Grokking at the Edge of Numerical Stability

    Paper • 2501.04697 • Published Jan 8 • 2
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs