Yanhong Zeng's picture

Yanhong Zeng

zengyh1900

·

https://zengyh1900.github.io/

AI & ML interests

Generative AI for Content Creation.

Recent Activity

upvoted a paper 7 days ago

Calligrapher: Freestyle Text Image Customization

authored a paper 3 months ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

liked a model 6 months ago

internlm/internlm3-8b-instruct-gptq-int4

View all activity

Organizations

None yet

authored a paper 3 months ago

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Paper • 2503.19990 • Published Mar 25 • 34

authored a paper 7 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 49

authored 2 papers 12 months ago

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Paper • 2407.17438 • Published Jul 24, 2024 • 27

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models

Paper • 2407.08701 • Published Jul 11, 2024 • 12

authored 3 papers about 1 year ago

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Paper • 2406.20085 • Published Jun 28, 2024 • 13

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds

Paper • 2407.01494 • Published Jul 1, 2024 • 15

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Paper • 2406.17758 • Published Jun 25, 2024 • 19

authored a paper over 1 year ago

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

Paper • 2312.13964 • Published Dec 21, 2023 • 20