10 38 121

Zijian Zhou PRO

franciszzj

https://sites.google.com/view/zijian-zhou/home

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization

liked a Space 12 days ago

black-forest-labs/FLUX.1-Kontext-Dev

updated a Space 17 days ago

franciszzj/Leffa

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization

Paper • 2508.14811 • Published 5 days ago • 38

upvoted a paper about 1 month ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7 • 59

upvoted 2 papers 2 months ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18 • 65

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 263

upvoted 4 papers 3 months ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2 • 47

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Paper • 2505.17952 • Published May 23 • 20

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 280

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 150

upvoted a collection 4 months ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 11 days ago • 296

upvoted a paper 4 months ago

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30 • 47

upvoted 2 papers 5 months ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 138

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published Mar 30 • 95

upvoted a collection 5 months ago

FLUX.1

Collection

A collection of our FLUX.1 models and LoRAs. • 10 items • Updated 19 days ago • 191

upvoted 5 papers 6 months ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10 • 55

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 45

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 64

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 200

upvoted a paper 8 months ago

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published Jan 7 • 19

upvoted a collection 8 months ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Jul 21 • 224

Zijian Zhou PRO

AI & ML interests

Recent Activity

Organizations

franciszzj's activity