You Li's picture

8 5 4

You Li

Michael4933

·

Michael4933

AI & ML interests

NLP, Multi-modal LLM

Organizations

None yet

upvoted 3 papers 3 months ago

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Paper • 2505.24298 • Published May 30 • 27

Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL

Paper • 2505.15436 • Published May 21 • 1

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Paper • 2505.14362 • Published May 20 • 2

upvoted a paper 5 months ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17 • 32

upvoted a paper 7 months ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Paper • 2501.05767 • Published Jan 10 • 30