microsoft
/

rad-dino

@@ -128,8 +128,6 @@ torch.Size([1, 768, 16, 16])
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
 We used images from five public, deidentified chest X-ray datasets to train this checkpoint of RAD-DINO.
-Images in the validation and test sets used to train [MAIRA](https://arxiv.org/abs/2311.13668) were excluded from the training set of RAD-DINO.
-The list of image files used for training is available at [`./training_images.csv`](./training_images.csv).
 | Dataset   | Num. images |
 | --------- | ----------: |
@@ -139,7 +137,12 @@ The list of image files used for training is available at [`./training_images.cs
 | [PadChest](https://www.sciencedirect.com/science/article/abs/pii/S1361841520301614) | 136 787 |
 | [BRAX](https://www.nature.com/articles/s41597-022-01608-8) | 41 260 |
-Note this checkpoint is different from the one in the paper, where some private data was used.
 ### Training procedure
@@ -189,11 +192,16 @@ Our evaluation is best described in the [manuscript](https://arxiv.org/abs/2401.
 <!-- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). -->
-- **Hardware Type:** NVIDIA A100 GPUs
-- **Hours used:** 47 hours/GPU × 4 nodes × 4 GPUs/node = 752 hours
-- **Cloud Provider:** Azure
-- **Compute Region:** West US 2
-- **Carbon Emitted:** 65.2 kg CO₂ eq.
 ### Compute infrastructure
@@ -201,7 +209,7 @@ RAD-DINO was trained on [Azure Machine Learning](https://azure.microsoft.com/en-
 #### Hardware
-We used four `Standard_NC96ads_A100_v4` nodes with four NVIDIA A100 (80 GB) GPUs each.
 #### Software
@@ -216,12 +224,12 @@ We used [SimpleITK](https://simpleitk.org/) and [Pydicom](https://pydicom.github
 ```bibtex
 @article{PerezGarcia2024RADDINOES,
-  title={{RAD-DINO}: Exploring Scalable Medical Image Encoders Beyond Text Supervision},
   author={Fernando Pérez-García and Harshita Sharma and Sam Bond-Taylor and Kenza Bouzid and Valentina Salvatelli and Maximilian Ilse and Shruthi Bannur and Daniel C. Castro and Anton Schwaighofer and Matthew P. Lungren and Maria Teodora Wetscherek and Noel Codella and Stephanie L. Hyland and Javier Alvarez-Valle and Ozan Oktay},
   journal={ArXiv},
   year={2024},
   volume={abs/2401.10815},
-  url={https://arxiv.org/abs/2401.10815}
 }
 ```

 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
 We used images from five public, deidentified chest X-ray datasets to train this checkpoint of RAD-DINO.
 | Dataset   | Num. images |
 | --------- | ----------: |
 | [PadChest](https://www.sciencedirect.com/science/article/abs/pii/S1361841520301614) | 136 787 |
 | [BRAX](https://www.nature.com/articles/s41597-022-01608-8) | 41 260 |
+Images in the validation and test sets used to train [MAIRA](https://arxiv.org/abs/2311.13668) were excluded from the training set of RAD-DINO.
+The list of image files used for training is available at [`./training_images.csv`](./training_images.csv).
+Note this checkpoint is different from the one in the paper, where some private data was used (and fewer GPUs).
+The checkpoint shared here is trained for 35 000 iterations (the total number of iterations in the run was 100 000, but we selected this checkpoint using linear probing on the validation sets of the evaluation datasets described in the paper).
+We used 16 nodes with 4 A100 GPUs each, and a batch size of 40 images per GPU.
 ### Training procedure
 <!-- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). -->
+<!-- Hardware type: A100 PCIe -->
+<!-- Hours: 1d 16h = 40h -->
+<!-- Cloud provider: Azure -->
+<!-- Region: Italy North -->
+- **Hardware type:** NVIDIA A100 GPUs
+- **Hours used:** 40 hours/GPU × 16 nodes × 4 GPUs/node = 2560 GPU-hours
+- **Cloud provider:** Azure
+- **Compute region:** West US 2
+- **Carbon emitted:** 222 kg CO₂ eq.
 ### Compute infrastructure
 #### Hardware
+We used 16 `Standard_NC96ads_A100_v4` nodes with four NVIDIA A100 (80 GB) GPUs each.
 #### Software
 ```bibtex
 @article{PerezGarcia2024RADDINOES,
+  title={RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision},
   author={Fernando Pérez-García and Harshita Sharma and Sam Bond-Taylor and Kenza Bouzid and Valentina Salvatelli and Maximilian Ilse and Shruthi Bannur and Daniel C. Castro and Anton Schwaighofer and Matthew P. Lungren and Maria Teodora Wetscherek and Noel Codella and Stephanie L. Hyland and Javier Alvarez-Valle and Ozan Oktay},
   journal={ArXiv},
   year={2024},
   volume={abs/2401.10815},
+  url={https://api.semanticscholar.org/CorpusID:267060839}
 }
 ```