Finalized model with better hyperparameters and also including Determined AI checkpoint.pt

Browse files

Files changed (9) hide show

README.md +56 -22
image.png → images/image.png +0 -0
images/test_accuracy.png +0 -0
images/test_auc.png +0 -0
images/test_loss.png +0 -0
model/Determined_AI_checkpoint.pt +3 -0
model/Determined_AI_metadata.json +4 -0
model/config.json +10 -0
pytorch_model.bin → model/pytorch_model.bin +2 -2

README.md CHANGED Viewed

@@ -8,7 +8,9 @@ tags:
 datasets:
 - medmnist
 metrics:
 - accuracy
 ---
 # MedMNIST Active Learning Model
@@ -27,19 +29,65 @@ This model is designed for image classification tasks within the medical imaging
 ## Training Procedure
-- **Dataset:** [PathMNIST](https://medmnist.com/)
-- **Data Augmentation:**
   - Random resized cropping
   - Horizontal flipping
   - Random rotations
   - Color jittering
   - Gaussian blur
   - RandAugment
-- **Optimizer:** Stochastic Gradient Descent (SGD) with momentum
-- **Learning Rate Scheduler:** ReduceLROnPlateau
-- **Active Learning Strategy:** Mixed sampling combining uncertainty sampling and diversity sampling using Monte Carlo dropout and K-means clustering.
 ## Usage
 To utilize this model:
@@ -84,27 +132,13 @@ To utilize this model:
     print(f"Predicted class: {prediction}")
     ```
-## Evaluation
-The model was evaluated on the validation set of PathMNIST. Key performance metrics include:
-- **Accuracy:** 94%
-- **Loss:** 0.1775
-## Evaluation Metrics
-The following plot illustrates the validation loss over training batches during the active learning process. The consistent decrease in validation loss demonstrates the effectiveness of the active learning strategy in improving model performance.
-![Validation Loss](image.png)
-- **Validation Loss**: The graph shows a steady decline, indicating successful learning and convergence.
-- **Batches**: Represents the number of iterations over the dataset.
 ## License
-This project is licensed under the mit License.
 ## Acknowledgements
 - [MedMNIST Dataset](https://medmnist.com/)
 - [Determined AI](https://determined.ai/)

 datasets:
 - medmnist
 metrics:
+- loss
 - accuracy
+- area under the curve
 ---
 # MedMNIST Active Learning Model
 ## Training Procedure
+### Training Hyperparameters
+| Hyperparameter         | Value                  |
+|------------------------|------------------------|
+| Batch Size             | 53                    |
+| Initial Labeled Size   | 3559                  |
+| Learning Rate          | 0.01332344940133225    |
+| MC Dropout Passes      | 6                     |
+| Samples to Label       | 4430                  |
+| Weight Decay           | 0.00021921795989143406 |
+### Optimizer Settings
+The optimizer used during training was Stochastic Gradient Descent(SDG), with the following settings and a Learning Rate Scheduler of ReduceLROnPlateau:
+- `learning_rate = 0.01332344940133225`
+- `momentum = 0.9`
+- `weight_decay = 0.00021921795989143406`
+The model was trained with float32 precision.
+### Dataset
+[PathMNIST](https://medmnist.com/)
+### Data Augmentation
   - Random resized cropping
   - Horizontal flipping
   - Random rotations
   - Color jittering
   - Gaussian blur
   - RandAugment
+### Active Learning Strategy
+The active learning process was based on a mixed sampling strategy:
+- **Uncertainty Sampling**: Monte Carlo (MC) dropout was used to estimate uncertainty.
+- **Diversity Sampling**: K-means clustering was employed to ensure diverse samples.
+## Evaluation
+The model was evaluated on the validation set of PathMNIST. Key performance metrics include:
+- **Accuracy:** 94.72%
+- **Loss:** 0.2397
+- **AUC:** 99.73%
+## Graphs
+The following plots illustrates the validation loss, validation accuracy, and validation auc over batches(number of iterations over the dataset) during the active learning process.
+- **Validation Loss**
+![Validation Loss](images/test_loss.png)
+- **Validation Accuracy**
+![Validation Accuracy](images/test_accuracy.png)
+- **Validation AUC**
+![Validation AUC](images/test_auc.png)
 ## Usage
+All code for this model can be accessed in the following GitHub Repository:
+[Allen Cheung Determined_AI_Hackathon](https://github.com/AllenCheung0213/Determined_AI_Hackathon)
 To utilize this model:
     print(f"Predicted class: {prediction}")
     ```
 ## License
+This project is licensed under the MIT License.
 ## Acknowledgements
 - [MedMNIST Dataset](https://medmnist.com/)
 - [Determined AI](https://determined.ai/)
+- **Survey on Deep Active Learning**: Wang, H., Jin, Q., Li, S., Liu, S., Wang, M., & Song, Z. (2024). A comprehensive survey on deep active learning in medical image analysis. *Medical Image Analysis*, 95, 103201. [https://doi.org/10.1016/j.media.2024.103201](https://doi.org/10.1016/j.media.2024.103201)

image.png → images/image.png RENAMED Viewed

File without changes

images/test_accuracy.png ADDED Viewed

images/test_auc.png ADDED Viewed

images/test_loss.png ADDED Viewed

model/Determined_AI_checkpoint.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:90601111b45e812a9aa039a02d9bb133468216b348f3ff067ac211a0787e2e97
+size 188520735

model/Determined_AI_metadata.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "epochs_completed": 18,
+  "steps_completed": 54905
+}

model/config.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+    "model_type": "resnet50",
+    "num_classes": 9,
+    "input_size": [
+        3,
+        28,
+        28
+    ],
+    "architecture": "ResNet50"
+}

pytorch_model.bin → model/pytorch_model.bin RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2e047c032bcb68539b69c2ce93a52a7f72d5c45d9e512ee38f3f858f337dc210
-size 94393034

 version https://git-lfs.github.com/spec/v1
+oid sha256:db3f9ab941286c336727e5a0c4d4b35ff1b8db5b7f8519573600bd2ee0108ef7
+size 94397514