cminst
/

StreamMamba

Video Classification

Model card Files Files and versions

qingy2024 commited on Jul 12

Commit

91bc6c7

·

verified ·

1 Parent(s): 408c7d7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ This repository hosts pre-trained model checkpoints for cross-modal video-text u
 | Filename                | Size    | Description                                                                 |
 |-------------------------|---------|-----------------------------------------------------------------------------|
-| cross_mamba_film_ckpt.pt | 504 MB  | A cross-modal checkpoint combining vision and text using **FiLM** (Feature-wise Linear Modulation) layers, optimized for Mamba architecture. |
 | internvideo2_clip.pt    | 5.55 MB | CLIP component of **InternVideo2-B14** |
 | internvideo2_vision.pt  | 205 MB  | Vision encoder backbone for **InternVideo2-B14** |
 | mobileclip_blt.pt       | 599 MB  | Lightweight **MobileCLIP** variant (BLT) |

 | Filename                | Size    | Description                                                                 |
 |-------------------------|---------|-----------------------------------------------------------------------------|
+| cross_mamba_film_ckpt.pt | 504 MB  | A cross-modal checkpoint combining vision and text using **FiLM** (Feature-wise Linear Modulation) and **Mamba** layers |
 | internvideo2_clip.pt    | 5.55 MB | CLIP component of **InternVideo2-B14** |
 | internvideo2_vision.pt  | 205 MB  | Vision encoder backbone for **InternVideo2-B14** |
 | mobileclip_blt.pt       | 599 MB  | Lightweight **MobileCLIP** variant (BLT) |