Quanzeng You's picture

2 6

Quanzeng You

Ye27

·

AI & ML interests

None yet

Organizations

authored a paper 8 months ago

DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Paper • 2410.18666 • Published Oct 24, 2024 • 19

authored 2 papers 9 months ago

Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model

Paper • 2405.17815 • Published May 28, 2024

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 51

authored a paper 10 months ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 96

authored a paper about 1 year ago

ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27, 2024 • 56

authored 7 papers over 1 year ago

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding

Paper • 2403.01487 • Published Mar 3, 2024 • 16

COCO is "ALL'' You Need for Visual Instruction Fine-tuning

Paper • 2401.08968 • Published Jan 17, 2024 • 2

Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning

Paper • 2401.06805 • Published Jan 10, 2024 • 2

Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark

Paper • 1605.02677 • Published May 9, 2016 • 1

CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models

Paper • 2311.11567 • Published Nov 20, 2023 • 8

Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis

Paper • 2311.17126 • Published Nov 28, 2023 • 1

Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling

Paper • 2310.06389 • Published Oct 10, 2023 • 1