Visual models Running 621 621 Qwen2-VL-72B ๐ Engage in multi-modal conversations with images and videos