Wenqi Zhang's picture

Wenqi Zhang

zwq2018

·

zwq2018

AI & ML interests

LLM, Multimodal, Robotics

Recent Activity

upvoted a paper 20 days ago

AR-RAG: Autoregressive Retrieval Augmentation for Image Generation

upvoted a paper about 1 month ago

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence

upvoted a paper about 1 month ago

SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation

View all activity

Organizations

authored a paper 3 months ago

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

Paper • 2503.21696 • Published Mar 27 • 22

authored a paper 6 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107

authored 3 papers 12 months ago

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

Paper • 2306.07209 • Published Jun 12, 2023 • 2

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives

Paper • 2401.02009 • Published Jan 4, 2024 • 1

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 48