FastVLM: Efficient Vision Encoding for Vision Language Models
Paper
•
2412.13303
•
Published
•
48
Efficient Vision Encoding for Vision Language Models
Real-time video captioning powered by FastVLM
Note MLX checkpoint
Note MLX checkpoint
Note MLX checkpoint