Model hub for LLM-Neo, including Llama3.1-Neo-1B-100w and Minitron-4B-Depth-Neo-10w.
Rummy
yang31210999
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
3 days ago
Llama 4
new activity
13 days ago
google/gemma-3-27b-it:evals (PT vs IT)
Organizations
None yet
Collections
1
models
5
yang31210999/Llama3.1-1B-Neo-BAAI-1000k
Text Generation
•
Updated
•
39
•
2
yang31210999/Llama-3.1-Minitron-4B-Depth-Neo-BAAI-100k
Text Generation
•
Updated
•
24
•
1
yang31210999/Llama-3.2-1B-Instruct-Neo-BAAI-10k
Text Generation
•
Updated
•
38
yang31210999/H200-pile-0.01-15-10-5-neo-rank64-lr2e-4
Updated
yang31210999/1023-eval-matmulfree-370M-ckpt27
Updated
datasets
None public yet