A Step Towards Music Generation Foundation Model
Generate images from text prompts
Ask an LLM about Arxiv papers