Kyu Song's picture

Kyu Song

kyunocap

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Waver: Wave Your Way to Lifelike Video Generation

upvoted a paper 3 days ago

EdgeFusion: On-Device Text-to-Image Generation

liked a model 4 days ago

Qwen/Qwen-Image-Edit

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Waver: Wave Your Way to Lifelike Video Generation

Paper • 2508.15761 • Published 3 days ago • 22

upvoted a paper 3 days ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18, 2024 • 24

upvoted a paper 6 days ago

DINOv3

Paper • 2508.10104 • Published 11 days ago • 182

upvoted a paper 10 days ago

Cut2Next: Generating Next Shot via In-Context Tuning

Paper • 2508.08244 • Published 13 days ago • 12

upvoted a paper 12 days ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published 17 days ago • 57

upvoted a paper 19 days ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published 20 days ago • 215

upvoted a paper 30 days ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 120

upvoted 5 papers about 1 month ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

DreamPoster: A Unified Framework for Image-Conditioned Generative Poster Design

Paper • 2507.04218 • Published Jul 6 • 12

From KMMLU-Redux to KMMLU-Pro: A Professional Korean Benchmark Suite for LLM Evaluation

Paper • 2507.08924 • Published Jul 11 • 17

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Paper • 2507.09862 • Published Jul 14 • 49

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

Paper • 2507.08128 • Published Jul 10 • 9

upvoted 3 papers 2 months ago

Align Your Flow: Scaling Continuous-Time Flow Map Distillation

Paper • 2506.14603 • Published Jun 17 • 20

Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published Jun 9 • 28

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

upvoted 5 papers 3 months ago

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Paper • 2506.06276 • Published Jun 6 • 22

Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models

Paper • 2506.00996 • Published Jun 1 • 38

ATI: Any Trajectory Instruction for Controllable Video Generation

Paper • 2505.22944 • Published May 28 • 7

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 150

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8 • 83