Lyiase's picture

77 176

Lyiase

lyiase

·

AI & ML interests

None yet

Recent Activity

liked a model 15 days ago

ByteDance-Seed/BAGEL-7B-MoT

liked a model 20 days ago

IndexTeam/Index-anisora

liked a model 21 days ago

Wan-AI/Wan2.1-VACE-14B

View all activity

Organizations

None yet

lyiase's activity

upvoted 9 papers 2 months ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published Apr 1 • 92

MixerMDM: Learnable Composition of Human Motion Diffusion Models

Paper • 2504.01019 • Published Apr 1 • 19

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Paper • 2504.01016 • Published Apr 1 • 29

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 77

AvatarArtist: Open-Domain 4D Avatarization

Paper • 2503.19906 • Published Mar 25 • 9

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

Paper • 2503.19901 • Published Mar 25 • 41

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 135

PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos

Paper • 2503.17973 • Published Mar 23 • 7

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 73

upvoted 11 papers 3 months ago

Concat-ID: Towards Universal Identity-Preserving Video Synthesis

Paper • 2503.14151 • Published Mar 18 • 10

AudioX: Diffusion Transformer for Anything-to-Audio Generation

Paper • 2503.10522 • Published Mar 13 • 26

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 128

Edit Transfer: Learning Image Editing via Vision In-Context Relations

Paper • 2503.13327 • Published Mar 17 • 29

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Paper • 2503.06053 • Published Mar 8 • 138

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16 • 66

MaRI: Material Retrieval Integration across Domains

Paper • 2503.08111 • Published Mar 11 • 7

VGGT: Visual Geometry Grounded Transformer

Paper • 2503.11651 • Published Mar 14 • 21

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 141

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 165

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published Mar 12 • 46