Analyze document layout from images
Describe images using multiple models
Extract tables from images and convert to CSV
Generate depth map from images
Ask questions about images and get answers
Read text from images
Compare two images with a sentence
Classify images to find their most likely categories