Transformers
GGUF
English
qwen_vl
video
real-time
multimodal
LLM
conversational