nvidia
/

MambaVision-B-1K

Image Classification

Model card Files Files and versions Community

This repository contains the data for the paper PAVE: Patching and Adapting Video Large Language Models.

Code: https://github.com/dragonlzm/PAVE

Citation [optional]

arxiv.org/abs/2503.19794

BibTeX:

@misc{liu2025pavepatchingadaptingvideo,
      title={PAVE: Patching and Adapting Video Large Language Models}, 
      author={Zhuoming Liu and Yiquan Li and Khoi Duc Nguyen and Yiwu Zhong and Yin Li},
      year={2025},
      eprint={2503.19794},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.19794}, 
}

Downloads last month: 992

Safetensors

Model size

97.7M params

Tensor type

F32

·

Inference Providers NEW

Image Classification

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including nvidia/MambaVision-B-1K

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 5 days ago • 31