PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model Paper • 1803.08225 • Published Mar 22, 2018
Three Pillars improving Vision Foundation Model Distillation for Lidar Paper • 2310.17504 • Published Oct 26, 2023
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning Paper • 2012.11552 • Published Dec 21, 2020
Localizing Objects with Self-Supervised Transformers and no Labels Paper • 2109.14279 • Published Sep 29, 2021
Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation Paper • 2203.11160 • Published Mar 21, 2022
What to Hide from Your Students: Attention-Guided Masked Image Modeling Paper • 2203.12719 • Published Mar 23, 2022
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data Paper • 2203.16258 • Published Mar 30, 2022
No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Paper • 2407.10964 • Published Jul 15, 2024
SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers Paper • 2312.00648 • Published Dec 1, 2023
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers Paper • 2501.08303 • Published Jan 14
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling Paper • 2502.09509 • Published Feb 13 • 7
Unsupervised Representation Learning by Predicting Image Rotations Paper • 1803.07728 • Published Mar 21, 2018