arxiv:2412.05271
Zhaoyang Liu
zyliu
AI & ML interests
Video understanding, 3D Perception, Autonomous driving, Foundation models, AIGC
Recent Activity
authored
a paper
about 1 month ago
InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots
Beyond Language
authored
a paper
about 1 month ago
Learning Human Motion Representations: A Unified Perspective
authored
a paper
about 1 month ago
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model
for Hundreds of Vision-Language Tasks
Organizations
spaces
2
models
11
zyliu/tmp_model11
Updated
•
3
zyliu/tmp_model10
Updated
•
2
zyliu/tmp_model9
Updated
•
1
zyliu/vllm3_tmp1
Updated
•
4
zyliu/tmp_model8
Updated
•
2
zyliu/tmp_model7
Updated
•
1
zyliu/tmp_model6
Updated
•
3
zyliu/tmp_model5
Updated
•
1
zyliu/tmp_model4
Updated
•
13
zyliu/tmp_gen_edit_model
Updated
•
16
datasets
None public yet