cminst
/

StreamMamba

Video Classification

Model card Files Files and versions

qingy2024 commited on Jul 14

Commit

5b2c2d4

·

verified ·

1 Parent(s): 5a610af

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ This repository hosts pre-trained model checkpoints for cross-modal video-text u
 | Filename                | Size    | Description                                                                 |
 |-------------------------|---------|-----------------------------------------------------------------------------|
-| cross_mamba_film_ckpt.pt | 504 MB  | A cross-modal checkpoint combining vision and text using **FiLM** (Feature-wise Linear Modulation) and **Mamba** layers |
 | internvideo2_clip.pt    | 5.55 MB | CLIP component of **InternVideo2-B14** |
 | internvideo2_vision.pt  | 205 MB  | Vision encoder backbone for **InternVideo2-B14** |
 | mobileclip_blt.pt       | 599 MB  | Lightweight **MobileCLIP** variant (BLT) |

 | Filename                | Size    | Description                                                                 |
 |-------------------------|---------|-----------------------------------------------------------------------------|
+| cross_mamba_film_warmup.pt | 504 MB  | A cross-modal checkpoint combining vision and text using **FiLM** (Feature-wise Linear Modulation) and **Mamba** layers |
 | internvideo2_clip.pt    | 5.55 MB | CLIP component of **InternVideo2-B14** |
 | internvideo2_vision.pt  | 205 MB  | Vision encoder backbone for **InternVideo2-B14** |
 | mobileclip_blt.pt       | 599 MB  | Lightweight **MobileCLIP** variant (BLT) |