
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
β’
2B
β’
Updated
β’
120k
β’
519
Ask questions about images to get detailed answers
A community project to create an image preferences dataset.
Generate clickable coordinates on a screenshot