Ju He's picture

Ju He

turkeyju

·

https://tacju.github.io/

TACJu

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Seed1.5-VL Technical Report

published a model 7 days ago

turkeyju/code

authored a paper 10 days ago

PartImageNet: A Large, High-Quality Dataset of Parts

View all activity

Organizations

turkeyju's activity

upvoted a paper 2 days ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 4 days ago • 117

upvoted a paper 14 days ago

ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

Paper • 2504.21855 • Published 15 days ago • 12

upvoted 4 papers 27 days ago

WORLDMEM: Long-term Consistent World Simulation with Memory

Paper • 2504.12369 • Published 29 days ago • 33

Antidistillation Sampling

Paper • 2504.13146 • Published 28 days ago • 61

Perception Encoder: The best visual embeddings are not at the output of the network

Paper • 2504.13181 • Published 28 days ago • 34

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published 29 days ago • 48

upvoted a paper 28 days ago

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Paper • 2504.10483 • Published Apr 14 • 21

upvoted a paper 29 days ago

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published about 1 month ago • 59

upvoted 12 papers about 1 month ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 131

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 255

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11 • 39

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published Apr 11 • 47

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 123

"Principal Components" Enable A New Language of Images

Paper • 2503.08685 • Published Mar 11 • 12

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8 • 73

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published Apr 8 • 81

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

Paper • 2504.02160 • Published Apr 2 • 37

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 160

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 109

Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published Apr 5 • 77