Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Yang Zhou
PRO
YangZhoumill
Follow
0 followers
·
1 following
IronSteveZhou
YangZhou08
AI & ML interests
MLSys and Efficient Deep Learning
Recent Activity
updated
a model
5 days ago
YangZhoumill/mixturewithif60percentt0000040000
published
a model
5 days ago
YangZhoumill/mixturewithif60percentt0000040000
updated
a model
5 days ago
YangZhoumill/mixturewithif35percenttopenmathr0000040000
View all activity
Organizations
YangZhoumill
's models
114
Sort: Recently updated
YangZhoumill/mixturesmallnoif0000010000
3B
•
Updated
30 days ago
•
63
YangZhoumill/mixturesmallnoif0000008000
3B
•
Updated
30 days ago
•
8
YangZhoumill/mixturesmallnoif0000005000
3B
•
Updated
30 days ago
•
63
YangZhoumill/mixturesmallnoif0000001000
3B
•
Updated
30 days ago
•
76
YangZhoumill/mixturewithif0000018000
3B
•
Updated
about 1 month ago
•
448
YangZhoumill/mixturewithif0000015000
3B
•
Updated
about 1 month ago
•
160
YangZhoumill/mixturewithif1000
3B
•
Updated
about 1 month ago
•
62
YangZhoumill/Llama-3.2-3B-instruction_following_2000
3B
•
Updated
Jul 25
•
7
YangZhoumill/llama3bwebonly5B
3B
•
Updated
Jul 11
•
8
YangZhoumill/llama3bthinkingonly5B
3B
•
Updated
Jul 11
•
9
YangZhoumill/remedy_llama3.2_3b
Updated
Jun 21
YangZhoumill/deepscalrr_1.5b_postmath
2B
•
Updated
Jun 10
•
135
YangZhoumill/Qwen3-30B-A3B
Updated
May 31
YangZhoumill/climb_234567_8k_step300
2B
•
Updated
May 10
•
5
YangZhoumill/climb_234567_8k_step1020
2B
•
Updated
May 10
•
5
YangZhoumill/climb_234567_8k
2B
•
Updated
May 10
•
5
YangZhoumill/scratch_234567rewritee_step300
2B
•
Updated
May 5
•
5
YangZhoumill/climb_8910111213_step300
2B
•
Updated
Apr 25
•
5
YangZhoumill/Qwen2.5-0.5B-Instruct
Text Generation
•
0.5B
•
Updated
Apr 24
•
5
YangZhoumill/scratch_234567_step360
2B
•
Updated
Apr 23
•
6
YangZhoumill/scratch_234567_step330
2B
•
Updated
Apr 23
•
5
YangZhoumill/scratch_234567_step210
2B
•
Updated
Apr 23
•
5
YangZhoumill/scratch_234567_step300
2B
•
Updated
Apr 23
•
5
YangZhoumill/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
0.5B
•
Updated
Mar 26
•
5
Previous
1
2
3
4
Next