Identify and segment objects in images
Generate realistic dialogue from a script, using Dia!
Describe images using questions
Transcribe audio and identify background sounds