PuLID-FLUX
Generate images from text prompts and ID images
Generate images from text prompts and ID images
Generate 3D models from images
Fill in image areas using prompts and masks
Generate music from text descriptions
Upscale low-resolution images to high resolution
Personalised Podcasts For All - Available in 13 Languages
Import a portrait, click to move the head!
Efficient T2V generation
Co-Speech Gesture Video Generation
Generate images from text prompts
Generate music from text descriptions
8B parameter transformer model distilled from the FLUX.1-dev
Detect and estimate human poses in images and videos
Generate 3D models from images
Make Custom Voices With KokoroTTS
In-browser unified multimodal understanding and generation.
Generate music from lyrics and genre tags
Remove background from images and videos
Audio Gen, Audio Style Transfer and Audio InPainting
Generate responses using images and text input
Generate images from text descriptions
OmniParser, turn your LLM into GUI agent
A Generalist Diffusion Model for Vision Perception
Blazingly Fast and Embarrassingly Simple Song Generation
Large Avatar Model for One-shot Animatable Gaussian Head
Generate realistic dialogue from a script, using Dia!
ultra-fast video model, LTX 0.9.8 13B distilled
Demo for multimodal understanding and generation
Multimodal Instruction-based Editing and Generation
Fast 4 step inference with Qwen Image Edit 2509
Track and label objects in videos using text prompts or clicks
Focus and blur parts of an image interactively