Chen Cui
cuichenx
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
deepseek-ai/DeepSeek-V3
new activity
4 days ago
deepseek-ai/DeepSeek-V3:`aux_loss_alpha` should be 1e-4 instead of 1e-3?
updated
a model
4 days ago
deepseek-ai/DeepSeek-V3-Base
Organizations
cuichenx's activity
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#61 opened 4 days ago
by
cuichenx
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#60 opened 4 days ago
by
cuichenx