Small casual language models trained for the evaluation of sample efficiency.

Daniel Christoph
J4bb4wukis
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 months ago
J4bb4wukis/llama_208m_wikipedia_en_shuffeld
updated
a model
about 2 months ago
J4bb4wukis/llama_360m_wikipedia_en_shuffeld
updated
a model
about 2 months ago
J4bb4wukis/llama_208m_wikipedia_en_shuffeld
Organizations
None yet
Collections
1
models
9

J4bb4wukis/llama_208m_wikipedia_en_shuffeld
Updated
•
4

J4bb4wukis/llama_360m_wikipedia_en_shuffeld
Updated
•
7

J4bb4wukis/xlstm_406m_wikipedia_en_shuffeld
Updated
•
27

J4bb4wukis/mamba2_432m_wikipedia_en_shuffeld
Updated
•
3

J4bb4wukis/gpt2_355m_wikipedia_en_shuffeld
Updated
•
3

J4bb4wukis/gpt2_209m_wikipedia_en_shuffeld
Updated
•
3

J4bb4wukis/gpt2_124m_wikipedia_en_shuffeld
Updated
•
4

J4bb4wukis/xlstm_247m_wikipedia_en_shuffeld
Updated
•
5

J4bb4wukis/mamba2_172m_wikipedia_en_shuffeld
Updated
•
3
datasets
0
None public yet