Mu Cai
mucai
AI & ML interests
Computer Vision, Deep Learning, 3D Vision, Vision and Language,
Recent Activity
upvoted
a
paper
about 2 months ago
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token
Compression across Images, Videos, and Audios
upvoted
a
paper
8 months ago
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training