SheldonXu

Sheldoooon

https://sheldontsui.github.io/

SheldonTsui

AI & ML interests

3D-aware image synthesis

Recent Activity

upvoted a paper 5 days ago

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

upvoted a paper 6 days ago

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

upvoted a paper 11 days ago

RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation

View all activity

Organizations

None yet

upvoted a paper 5 days ago

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Paper • 2507.01945 • Published 6 days ago • 71

upvoted a paper 6 days ago

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

Paper • 2506.17450 • Published 17 days ago • 60

upvoted 2 papers 11 days ago

RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation

Paper • 2506.18088 • Published 16 days ago • 17

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published 14 days ago • 56

upvoted a paper 13 days ago

Light of Normals: Unified Feature Representation for Universal Photometric Stereo

Paper • 2506.18882 • Published 15 days ago • 84

upvoted 3 papers 14 days ago

Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details

Paper • 2506.16504 • Published 19 days ago • 23

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Paper • 2506.15442 • Published 20 days ago • 12

ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies

Paper • 2506.14315 • Published 21 days ago • 10

upvoted a paper 19 days ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published 20 days ago • 62

upvoted a paper 20 days ago

Test3R: Learning to Reconstruct 3D at Test Time

Paper • 2506.13750 • Published 22 days ago • 27

upvoted 4 papers 21 days ago

BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

Paper • 2506.07961 • Published 29 days ago • 12

upvoted 3 papers 29 days ago

Geometry-Editable and Appearance-Preserving Object Compositon

Paper • 2505.20914 • Published May 27 • 6

FlexPainter: Flexible and Multi-View Consistent Texture Generation

Paper • 2506.02620 • Published Jun 3 • 14

Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation

Paper • 2506.04225 • Published Jun 4 • 25

upvoted 3 papers about 1 month ago

LayerFlow: A Unified Model for Layer-aware Video Generation

Paper • 2506.04228 • Published Jun 4 • 13

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

Paper • 2506.03135 • Published Jun 3 • 37

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models

Paper • 2505.22865 • Published May 28 • 2

SheldonXu

AI & ML interests

Recent Activity

Organizations

Sheldoooon's activity