luokai's picture

41 213

luokai

iamluokai

·

iamluokai

AI & ML interests

None yet

Recent Activity

liked a Space about 24 hours ago

hujiecpp/PE3R

upvoted a paper 5 days ago

SkyReels-A2: Compose Anything in Video Diffusion Transformers

liked a model 5 days ago

Skywork/SkyReels-A2

View all activity

Organizations

iamluokai's activity

upvoted a paper 5 days ago

SkyReels-A2: Compose Anything in Video Diffusion Transformers

Paper • 2504.02436 • Published 6 days ago • 29

upvoted a paper 23 days ago

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published 25 days ago • 131

upvoted a collection 27 days ago

Wan2.1 14B 480p I2V LoRAs

A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 39 items • Updated 8 days ago • 97

upvoted a collection about 1 month ago

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 20 days ago • 103

upvoted a collection about 2 months ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 3 days ago • 216

upvoted a paper about 2 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 55

upvoted 2 papers 3 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 92

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 84

upvoted 3 collections 4 months ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 6 days ago • 146

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 24 days ago • 90

CogVideo

10 items • Updated about 19 hours ago • 50

upvoted a paper 5 months ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 75

upvoted a collection 5 months ago

LongVU

7 items • Updated Oct 31, 2024 • 30

upvoted a paper 6 months ago

Framer: Interactive Frame Interpolation

Paper • 2410.18978 • Published Oct 24, 2024 • 38

upvoted a collection 6 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 26 days ago • 300

upvoted 2 papers 7 months ago

LVCD: Reference-based Lineart Video Colorization with Diffusion Models

Paper • 2409.12960 • Published Sep 19, 2024 • 25

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published Sep 13, 2024 • 52

upvoted a collection 8 months ago

Jamba 1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Mar 6 • 87

upvoted a paper 8 months ago

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12, 2024 • 15