StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams Paper β’ 2506.08862 β’ Published 17 days ago β’ 5
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction Paper β’ 2403.18795 β’ Published Mar 27, 2024 β’ 21
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling Paper β’ 2406.06367 β’ Published Jun 10, 2024
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior Paper β’ 2401.09050 β’ Published Jan 17, 2024
Attention Prompting on Image for Large Vision-Language Models Paper β’ 2409.17143 β’ Published Sep 25, 2024 β’ 7
Mugs: A Multi-Granular Self-Supervised Learning Framework Paper β’ 2203.14415 β’ Published Mar 27, 2022
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet Paper β’ 2101.11986 β’ Published Jan 28, 2021
ConvBERT: Improving BERT with Span-based Dynamic Convolution Paper β’ 2008.02496 β’ Published Aug 6, 2020
Emerging Properties in Unified Multimodal Pretraining Paper β’ 2505.14683 β’ Published May 20 β’ 130
Optimizing Anytime Reasoning via Budget Relative Policy Optimization Paper β’ 2505.13438 β’ Published May 19 β’ 35
General-Reasoner: Advancing LLM Reasoning Across All Domains Paper β’ 2505.14652 β’ Published May 20 β’ 22
Optimizing Anytime Reasoning via Budget Relative Policy Optimization Paper β’ 2505.13438 β’ Published May 19 β’ 35