Train models with images and text
Analyze images to identify and tag content
Generate music from text and melody descriptions