Characterizing signal propagation to close the performance gap in unnormalized ResNets Paper • 2101.08692 • Published Jan 21, 2021 • 2
When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations Paper • 2106.01548 • Published Jun 3, 2021 • 2
ResNet strikes back: An improved training procedure in timm Paper • 2110.00476 • Published Oct 1, 2021 • 2
A ResNet is All You Need? Modeling A Strong Baseline for Detecting Referable Diabetic Retinopathy in Fundus Images Paper • 2210.03180 • Published Oct 6, 2022
Revisiting ResNets: Improved Training and Scaling Strategies Paper • 2103.07579 • Published Mar 13, 2021 • 2
Aggregated Residual Transformations for Deep Neural Networks Paper • 1611.05431 • Published Nov 16, 2016 • 2
RTSeg: Real-time Semantic Segmentation Comparative Study Paper • 1803.02758 • Published Mar 7, 2018 • 2
Latent Diffusion Model for Medical Image Standardization and Enhancement Paper • 2310.05237 • Published Oct 8, 2023 • 2
3D Medical Image Segmentation based on multi-scale MPU-Net Paper • 2307.05799 • Published Jul 11, 2023 • 2
Joint Liver and Hepatic Lesion Segmentation in MRI using a Hybrid CNN with Transformer Layers Paper • 2201.10981 • Published Jan 26, 2022 • 2
Bootstrap your own latent: A new approach to self-supervised Learning Paper • 2006.07733 • Published Jun 13, 2020 • 2
From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in Histopathology Paper • 2204.05044 • Published Apr 11, 2022 • 2
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology Paper • 2203.00585 • Published Mar 1, 2022 • 2
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks Paper • 1905.11946 • Published May 28, 2019 • 3
DAS: A Deformable Attention to Capture Salient Information in CNNs Paper • 2311.12091 • Published Nov 20, 2023 • 2
Semi-Supervised Semantic Segmentation using Redesigned Self-Training for White Blood Cells Paper • 2401.07278 • Published Jan 14, 2024 • 2
Adding Conditional Control to Text-to-Image Diffusion Models Paper • 2302.05543 • Published Feb 10, 2023 • 45
Data Distributional Properties Drive Emergent In-Context Learning in Transformers Paper • 2205.05055 • Published Apr 22, 2022 • 2
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents Paper • 2004.12629 • Published Apr 27, 2020 • 2
Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT Paper • 2401.03302 • Published Jan 6, 2024 • 1
Detecting and recognizing characters in Greek papyri with YOLOv8, DeiT and SimCLR Paper • 2401.12513 • Published Jan 23, 2024 • 1
DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets Paper • 2404.02900 • Published Apr 3, 2024 • 1
Transferable and Principled Efficiency for Open-Vocabulary Segmentation Paper • 2404.07448 • Published Apr 11, 2024 • 12
ConsistencyDet: Robust Object Detector with Denoising Paradigm of Consistency Model Paper • 2404.07773 • Published Apr 11, 2024 • 1
What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation Paper • 2404.07129 • Published Apr 10, 2024 • 3
Multiplication-Free Transformer Training via Piecewise Affine Operations Paper • 2305.17190 • Published May 26, 2023 • 2
Large Scale GAN Training for High Fidelity Natural Image Synthesis Paper • 1809.11096 • Published Sep 28, 2018 • 1
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era Paper • 1707.02968 • Published Jul 10, 2017 • 1
Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space Paper • 2406.19370 • Published Jun 27, 2024 • 1
Fixup Initialization: Residual Learning Without Normalization Paper • 1901.09321 • Published Jan 27, 2019 • 1
RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness Paper • 2206.14502 • Published Jun 29, 2022 • 1
RT-DETRv2: Improved Baseline with Bag-of-Freebies for Real-Time Detection Transformer Paper • 2407.17140 • Published Jul 24, 2024 • 1
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published Dec 16, 2024 • 41