Visual Question Answering
Transformers
Safetensors
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
AVL / requirements.txt
ccclemenfff's picture
add dependencies to requirements.txt
af262cd
raw
history blame
650 Bytes
--extra-index-url https://download.pytorch.org/whl/cu118
# basic dependencies
torch==2.2.0
torchvision==0.17.0
transformers==4.40.0
tokenizers==0.19.1
deepspeed==0.13.1
accelerate==0.26.1
peft==0.4.0
timm==1.0.3
numpy==1.24.4
# data processing
decord==0.6.0
imageio==2.34.0
imageio-ffmpeg==0.4.9
moviepy==1.0.3
opencv-python==4.6.0.66
pysubs2
# misc
scikit-learn==1.2.2
huggingface_hub==0.23.4
diffusers==0.28.1
sentencepiece==0.1.99
shortuuid
einops==0.6.1
einops-exts==0.0.4
bitsandbytes==0.43.0
pydantic>=2.0
markdown2[all]
gradio==3.50.0
gradio_client==0.6.1
httpx==0.24.1
requests
openai
uvicorn
fastapi
tensorboard
wandb
tabulate
spaces==0.29.2