NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 4 days ago • 25
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 1 day ago • 77
Nested Learning: The Illusion of Deep Learning Architectures Paper • 2512.24695 • Published 8 days ago • 31
DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer Paper • 2601.01425 • Published 4 days ago • 41
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models Paper • 2512.22238 • Published 16 days ago • 18
Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting Paper • 2512.20927 • Published 15 days ago • 7
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published 13 days ago • 57
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 10 days ago • 93
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 10 days ago • 64
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 16 days ago • 60
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published 22 days ago • 29
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation Paper • 2512.17040 • Published 21 days ago • 27
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published 21 days ago • 42
Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language Paper • 2512.11251 • Published 27 days ago • 6
StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors Paper • 2512.16915 • Published 21 days ago • 37