Uses Facebook musicgen small model
Generate high-quality images from text prompts
Generate text using the FLUX.1-dev model
Summarize text input