Slava's picture

16

Slava

wertlon

slava-qw

AI & ML interests

CV, GenAI

Recent Activity

upvoted a paper about 22 hours ago

PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

upvoted a paper about 23 hours ago

Whole-Body Conditioned Egocentric Video Prediction

upvoted a paper about 23 hours ago

HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

View all activity

Organizations

None yet

upvoted a paper about 22 hours ago

PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published 22 days ago • 68

upvoted 3 papers about 23 hours ago

Whole-Body Conditioned Egocentric Video Prediction

Paper • 2506.21552 • Published 1 day ago • 4

HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

Paper • 2506.20452 • Published 3 days ago • 10

Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models

Paper • 2506.19103 • Published 4 days ago • 38

upvoted a paper 2 days ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published 5 days ago • 59

upvoted 4 papers 3 days ago

Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales

Paper • 2506.19713 • Published 4 days ago • 12

SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution

Paper • 2506.19838 • Published 3 days ago • 11

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published 3 days ago • 24

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published 3 days ago • 50

upvoted 3 papers 4 days ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published 4 days ago • 66

VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Paper • 2506.18903 • Published 4 days ago • 18

Auto-Regressively Generating Multi-View Consistent Images

Paper • 2506.18527 • Published 5 days ago • 6

upvoted 3 papers 5 days ago

ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies

Paper • 2506.14315 • Published 11 days ago • 10

Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details

Paper • 2506.16504 • Published 8 days ago • 20

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Paper • 2506.17201 • Published 7 days ago • 43

upvoted a paper 8 months ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published Oct 23, 2024 • 210