LI

RogerZhuo

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

LLMs Get Lost In Multi-Turn Conversation

liked a Space 13 days ago

ResembleAI/Chatterbox

liked a model 13 days ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

RogerZhuo's activity

upvoted a paper 7 days ago

LLMs Get Lost In Multi-Turn Conversation

Paper • 2505.06120 • Published May 9 • 6

upvoted a paper 14 days ago

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Paper • 2505.18445 • Published 18 days ago • 63

upvoted 2 papers 18 days ago

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Paper • 2505.16707 • Published 20 days ago • 42

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published 19 days ago • 116

upvoted a paper 19 days ago

Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation

Paper • 2503.18429 • Published Mar 24 • 2

upvoted a paper 28 days ago

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Paper • 2505.07747 • Published 29 days ago • 60

upvoted a collection 30 days ago

SkyReels-V2

Collection

Infinite-length Film Generative Model • 9 items • Updated Apr 24 • 41

upvoted 2 papers about 2 months ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 99

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24 • 88

upvoted a collection about 2 months ago

Animagine XL 4.0

Collection

The Ultimate Anime-themed SDXL Model • 3 items • Updated Feb 13 • 11

upvoted a paper about 2 months ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 85

upvoted a paper 2 months ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 129

upvoted a collection 2 months ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Apr 12 • 65

upvoted 6 papers 2 months ago

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

Paper • 2504.02542 • Published Apr 3 • 46

Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models

Paper • 2405.04233 • Published May 7, 2024 • 2

Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis

Paper • 2411.01156 • Published Nov 2, 2024 • 7

upvoted a paper 3 months ago

Reinforcement Learning: An Overview

Paper • 2412.05265 • Published Dec 6, 2024 • 8