arxiv:2507.07999
Xiangtai Li
LXT
AI & ML interests
Computer Vision, Multi-Modal Understanding, Generative AI
Recent Activity
authored
a paper
about 1 month ago
Mixed-R1: Unified Reward Perspective For Reasoning Capability in
Multimodal Large Language Models
authored
a paper
about 1 month ago
UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions
authored
a paper
about 1 month ago
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and
Methodology