Zihao Yue's picture

3 8 3

Zihao Yue PRO

yuezih

·

https://yuezih.github.io/

yuezih

AI & ML interests

Multimodality

Recent Activity

liked a model 12 days ago

XiaomiMiMo/MiMo-VL-7B-SFT-2508

upvoted a paper about 1 month ago

DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning

upvoted a paper about 2 months ago

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning

Paper • 2508.05405 • Published about 1 month ago • 63

upvoted a paper about 2 months ago

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos

Paper • 2507.15597 • Published Jul 21 • 33

upvoted a paper 3 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4 • 79

upvoted 2 papers 4 months ago

VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning

Paper • 2505.12081 • Published May 17 • 18

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82

upvoted a paper 6 months ago

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16 • 69

upvoted 2 papers 9 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 49

Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation

Paper • 2306.13460 • Published Jun 23, 2023 • 1