Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17
20
25
Shizhe Diao
shizhediao2
Follow
bunyaminergen's profile picture
research4pan's profile picture
21world's profile picture
6 followers
·
11 following
https://shizhediao.github.io/
shizhediao
shizhediao
shizhediao
AI & ML interests
LLM pre-training and reasoning
Recent Activity
liked
a dataset
7 days ago
OptimalScale/ClimbMix
liked
a model
13 days ago
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
upvoted
a
paper
14 days ago
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
View all activity
Organizations
shizhediao2
's models
1
Sort: Recently updated
shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B
Updated
May 14