Generate text-to-speech, extract text from images, repair image sections
Identify and segment objects in images
Wait for task completion