Zheng
HuggingJerry
AI & ML interests
None yet
Recent Activity
new activity
21 days ago
Qwen/README:Potential Issue: Load Balancing Loss May Mask Per-Layer Expert Imbalances
liked
a model
about 1 year ago
deepseek-ai/deepseek-coder-33b-instruct
liked
a model
over 1 year ago
mistralai/Mixtral-8x7B-Instruct-v0.1
Organizations
None yet
models
0
None public yet
datasets
0
None public yet