Mark regions in images based on text descriptions
Chat with an AI that understands text and images
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Segment images using texts, points, or everything mode