birder-project
/

mvit_v2_t_il-all

Image Classification

Model card Files Files and versions Community

hassonofer commited on 28 days ago

Commit

d88de45

·

verified ·

1 Parent(s): 0df9b05

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ license: apache-2.0
 # Model Card for mvit_v2_t_il-all
-MViTv2 image classification model. This model was trained on the `il-all` dataset (all the relevant bird species found in Israel inc. rarities).
 The species list is derived from data available at <https://www.israbirding.com/checklist/>.
@@ -16,12 +16,12 @@ The species list is derived from data available at <https://www.israbirding.com/
 - **Model Type:** Image classification and detection backbone
 - **Model Stats:**
-  - Params (M): 23.9
-  - Input image size: 384 x 384
 - **Dataset:** il-all (550 classes)
 - **Papers:**
-  - MViTv2: Improved Multiscale Vision Transformers for Classification and Detection: <https://arxiv.org/abs/2112.01526>
 ## Model Usage
@@ -39,9 +39,9 @@ size = birder.get_size_from_signature(signature)
 # Create an inference transform
 transform = birder.classification_transform(size, rgb_stats)
-image = "path/to/image.jpeg"  # or a PIL image
 (out, _) = infer_image(net, image, transform)
-# out is a NumPy array with shape of (1, num_classes)
 ```
 ### Image Embeddings

 # Model Card for mvit_v2_t_il-all
+A MViTv2 image classification model. This model was trained on the `il-all` dataset, encompassing all relevant bird species found in Israel, including rarities.
 The species list is derived from data available at <https://www.israbirding.com/checklist/>.
 - **Model Type:** Image classification and detection backbone
 - **Model Stats:**
+    - Params (M): 23.9
+    - Input image size: 384 x 384
 - **Dataset:** il-all (550 classes)
 - **Papers:**
+    - MViTv2: Improved Multiscale Vision Transformers for Classification and Detection: <https://arxiv.org/abs/2112.01526>
 ## Model Usage
 # Create an inference transform
 transform = birder.classification_transform(size, rgb_stats)
+image = "path/to/image.jpeg"  # or a PIL image, must be loaded in RGB format
 (out, _) = infer_image(net, image, transform)
+# out is a NumPy array with shape of (1, num_classes), representing class probabilities.
 ```
 ### Image Embeddings