Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ This model is developed for **cross-modal retrieval**, **few-shot classification
|
|
27 |
|
28 |
| Component | Architecture |
|
29 |
|------------------|-------------------------------------------|
|
30 |
-
| Image Encoder | Swin
|
31 |
| Text Encoder | RoBERTa-base |
|
32 |
| Projection Head | Linear layer (to 512-D space) |
|
33 |
| Normalization | L2 on both embeddings |
|
|
|
27 |
|
28 |
| Component | Architecture |
|
29 |
|------------------|-------------------------------------------|
|
30 |
+
| Image Encoder | Swin Tiny (patch4, window7, 224 resolution) |
|
31 |
| Text Encoder | RoBERTa-base |
|
32 |
| Projection Head | Linear layer (to 512-D space) |
|
33 |
| Normalization | L2 on both embeddings |
|