Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Quan Sun's picture
7 9 5

Quan Sun

QuanSun
BobbyBearz's profile picture connyxu's profile picture dzl61717003's profile picture
·
  • Quan-Sun

AI & ML interests

Deep Learning, Foundation Model

Organizations

Beijing Academy of Artificial Intelligence's profile picture StepFun's profile picture

upvoted a collection 5 months ago

NextStep-1

Collection
9 items • Updated 11 days ago • 31
upvoted a paper 5 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 145
upvoted 2 papers 6 months ago

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Paper • 2507.05255 • Published Jul 7, 2025 • 74

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Paper • 2507.08441 • Published Jul 11, 2025 • 61
upvoted a paper 8 months ago

End-to-End Vision Tokenizer Tuning

Paper • 2505.10562 • Published May 15, 2025 • 22
upvoted 2 papers over 1 year ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 96

Diffusion Feedback Helps CLIP See Better

Paper • 2407.20171 • Published Jul 29, 2024 • 36
upvoted 2 papers about 2 years ago

Generative Multimodal Models are In-Context Learners

Paper • 2312.13286 • Published Dec 20, 2023 • 36

CapsFusion: Rethinking Image-Text Data at Scale

Paper • 2310.20550 • Published Oct 31, 2023 • 27
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs