Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17
20
25
Shizhe Diao
shizhediao2
Follow
21world's profile picture
xx18's profile picture
research4pan's profile picture
6 followers
ยท
11 following
https://shizhediao.github.io/
shizhediao
shizhediao
shizhediao
AI & ML interests
LLM pre-training and reasoning
Recent Activity
liked
a dataset
5 days ago
OptimalScale/ClimbMix
liked
a model
12 days ago
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
upvoted
a
paper
12 days ago
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
View all activity
Organizations
models
1
shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B
Updated
May 14
datasets
0
None public yet