1 53 26

Kyu Song

kyunocap

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Waver: Wave Your Way to Lifelike Video Generation

upvoted a paper 3 days ago

EdgeFusion: On-Device Text-to-Image Generation

liked a model 5 days ago

Qwen/Qwen-Image-Edit

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Waver: Wave Your Way to Lifelike Video Generation

Paper • 2508.15761 • Published 3 days ago • 22

upvoted a paper 3 days ago

EdgeFusion: On-Device Text-to-Image Generation

Paper • 2404.11925 • Published Apr 18, 2024 • 24

liked a model 5 days ago

Qwen/Qwen-Image-Edit

Image-to-Image • Updated 6 days ago • 36.4k • • 1.19k

upvoted a paper 6 days ago

DINOv3

Paper • 2508.10104 • Published 11 days ago • 183

upvoted a paper 11 days ago

Cut2Next: Generating Next Shot via In-Context Tuning

Paper • 2508.08244 • Published 13 days ago • 12

upvoted a paper 13 days ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published 18 days ago • 57

upvoted a paper 19 days ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published 20 days ago • 216

liked a dataset 23 days ago

jungjee/spoofceleb

Updated Nov 24, 2024 • 188 • 8

liked 2 models 27 days ago

Wan-AI/Wan2.2-T2V-A14B

Text-to-Video • Updated 17 days ago • 14.5k • • 243

Wan-AI/Wan2.2-I2V-A14B

Image-to-Video • Updated 17 days ago • 11k • • 248

upvoted a paper 30 days ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 120

liked a Space about 1 month ago

2.6k

Anycoder

🏢

Generate modern HTML from existing code

upvoted 5 papers about 1 month ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

DreamPoster: A Unified Framework for Image-Conditioned Generative Poster Design

Paper • 2507.04218 • Published Jul 6 • 12

From KMMLU-Redux to KMMLU-Pro: A Professional Korean Benchmark Suite for LLM Evaluation

Paper • 2507.08924 • Published Jul 11 • 17

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation

Paper • 2507.09862 • Published Jul 14 • 49

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models

Paper • 2507.08128 • Published Jul 10 • 9

New activity in jt-zhang/SageAttention2_plus about 2 months ago

It seems that sm90 cannot use sageattention2++ kernel

#2 opened about 2 months ago by

kyunocap

liked a model 2 months ago

jt-zhang/SageAttention2_plus

Updated Jul 18 • 19

liked a Space 2 months ago

1.04k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training