Xi Yang's picture

Xi Yang

ianyeung

·

IanYeung

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

mHC: Manifold-Constrained Hyper-Connections

liked a model 9 days ago

stdstu123/Yume-5B-720P

liked a model 10 days ago

Wan-AI/Wan2.2-TI2V-5B-Diffusers

View all activity

Organizations

None yet

upvoted a paper 4 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 8 days ago • 224

upvoted a collection 20 days ago

VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation • 4 items • Updated 23 days ago • 39

upvoted a paper 21 days ago

Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets

Paper • 2512.15110 • Published 22 days ago • 8

upvoted a paper 22 days ago

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Paper • 2512.14614 • Published 23 days ago • 67

upvoted an article about 1 month ago

Article

Diffusers welcomes FLUX-2

+6

Nov 25, 2025

•

168

upvoted a paper about 1 month ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published Nov 22, 2025 • 37

upvoted a paper about 2 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 227

upvoted a collection about 2 months ago

Wan2.1-Fun-V1.1

6 items • Updated Oct 9, 2025 • 8

upvoted a paper 2 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 119

upvoted a collection 3 months ago

Qwen3-VL

37 items • Updated 8 days ago • 559

upvoted a paper 3 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 165

upvoted a collection 4 months ago

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 447

upvoted 4 papers 4 months ago

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Paper • 2509.09676 • Published Sep 11, 2025 • 33

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Paper • 2508.15774 • Published Aug 21, 2025 • 20

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted 4 papers 5 months ago

Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models

Paper • 2508.12945 • Published Aug 18, 2025 • 14

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Paper • 2508.13009 • Published Aug 18, 2025 • 25

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

Thyme: Think Beyond Images

Paper • 2508.11630 • Published Aug 15, 2025 • 81