Can Qin
canqin001
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and
Visual Documents
authored
a paper
4 months ago
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language
Models
upvoted
a
paper
9 months ago
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video
Even in VLMs