Update README.md
Browse files
README.md
CHANGED
@@ -32,12 +32,12 @@ This model is licensed under the <a href="https://www.apache.org/licenses/LICENS
|
|
32 |
| Filename | Size | Description |
|
33 |
|-------------------------|----------|-----------------------------------------------------------------------------|
|
34 |
| `cross_mamba_film_warmup.pt` | 504 MB | Cross-modal model combining vision and text using **FiLM** (Feature-wise Linear Modulation) and **Mamba** layers for temporal modeling. |
|
35 |
-
| `mamba_mobileclip_ckpt.pt` | 500 MB |
|
36 |
| `internvideo2_clip.pt` | 5.55 MB | CLIP-style vision-language alignment component for InternVideo2-B14. |
|
37 |
| `internvideo2_vision.pt` | 205 MB | Vision encoder backbone (InternVideo2-B14) for video feature extraction. |
|
38 |
| `mobileclip_blt.pt` | 599 MB | Lightweight **MobileCLIP** variant (BLT) for resource-constrained applications. |
|
39 |
|
40 |
-
#### Self-Predictive Frame Skipping (SPFS)
|
41 |
The `spfs_r64` folder contains a self-contained system for adaptive frame skipping in videos. Each checkpoint file includes:
|
42 |
- MobileCLIP vision/text encoders
|
43 |
- InternVideo2-B14 vision encoder weights
|
|
|
32 |
| Filename | Size | Description |
|
33 |
|-------------------------|----------|-----------------------------------------------------------------------------|
|
34 |
| `cross_mamba_film_warmup.pt` | 504 MB | Cross-modal model combining vision and text using **FiLM** (Feature-wise Linear Modulation) and **Mamba** layers for temporal modeling. |
|
35 |
+
| `mamba_mobileclip_ckpt.pt` | 500 MB | <span style="position: relative; cursor: help;"><span class="streammamba-glow">StreamMamba</span><span class="glow-ring"></span></span> temporal aggregator trained on MobileCLIP embeddings (no FiLM). Checkpoint 6900. |
|
36 |
| `internvideo2_clip.pt` | 5.55 MB | CLIP-style vision-language alignment component for InternVideo2-B14. |
|
37 |
| `internvideo2_vision.pt` | 205 MB | Vision encoder backbone (InternVideo2-B14) for video feature extraction. |
|
38 |
| `mobileclip_blt.pt` | 599 MB | Lightweight **MobileCLIP** variant (BLT) for resource-constrained applications. |
|
39 |
|
40 |
+
#### <span style="position: relative; cursor: help;"><span class="streammamba-glow">StreamMamba</span><span class="glow-ring"></span></span> Self-Predictive Frame Skipping (SPFS)
|
41 |
The `spfs_r64` folder contains a self-contained system for adaptive frame skipping in videos. Each checkpoint file includes:
|
42 |
- MobileCLIP vision/text encoders
|
43 |
- InternVideo2-B14 vision encoder weights
|