2 14 3

Oooowi

ZiruiZheng

zhengzirui

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

Hzzone/GLIGEN_COCO

upvoted a paper about 2 months ago

ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

commented on a paper about 2 months ago

Towards Physically Plausible Video Generation via VLM Planning

View all activity

Organizations

ZiruiZheng's activity

liked a model about 1 month ago

Hzzone/GLIGEN_COCO

Updated May 10, 2024 • 2 • 1

upvoted a paper about 2 months ago

ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

Paper • 2504.01934 • Published Apr 2 • 23

commented a paper about 2 months ago

Towards Physically Plausible Video Generation via VLM Planning

Paper • 2503.23368 • Published Mar 30 • 40 •

upvoted 2 papers about 2 months ago

Towards Physically Plausible Video Generation via VLM Planning

Paper • 2503.23368 • Published Mar 30 • 40

AMD-Hummingbird: Towards an Efficient Text-to-Video Model

Paper • 2503.18559 • Published Mar 24 • 5

upvoted 2 papers 3 months ago

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published Feb 27 • 30

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Paper • 2502.08639 • Published Feb 12 • 43

upvoted a paper 5 months ago

Autoregressive Video Generation without Vector Quantization

Paper • 2412.14169 • Published Dec 18, 2024 • 14

New activity in stabilityai/stable-diffusion-3.5-medium 5 months ago

Doesn't work boys - we'll get 'em next time. FIX INSIDE

#10 opened 7 months ago by

mushroomfleet

upvoted 2 papers 6 months ago

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Paper • 2411.17686 • Published Nov 26, 2024 • 21

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 55

updated a collection 8 months ago

in-context learning

Collection

2 items • Updated Sep 10, 2024

upvoted a paper 8 months ago

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

Paper • 2409.04410 • Published Sep 6, 2024 • 26

liked a Space 8 months ago

1.51k

InstructPix2Pix

🚀

Transform images based on text instructions

updated 2 collections 9 months ago

text-to-image

Collection

1 item • Updated Aug 12, 2024

representation

Collection

2 items • Updated Aug 12, 2024

liked a Space 9 months ago

SD 3 Medium GPU

👁

Generate stunning images from text prompts