deng
xiaodong123
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Demons in the Detail: On Implementing Load Balancing Loss for Training
Specialized Mixture-of-Expert Models
liked
a Space
6 months ago
Qwen/Qwen2-VL
liked
a model
over 1 year ago
Qwen/Qwen-7B
Organizations
models
None public yet
datasets
None public yet