Qwen3 ASR Demo
Convert audio to text with context and language options
Convert audio to text with context and language options
Generate high-quality images from text prompts
Generate images from text prompts
inpaint images using Qwen Image with inpainting Controlnet
UMO based on OmniGen2
Dedicated display for RTEB benchmark results
Flux Kontext extended with product placement capabilities
Generate 3D CAD models from images
Generate any application with DeepSeek
generate a video from an image with a text prompt
Generate expressive speech from text with emotion control
Generate a video by interpolating between two images with a prompt
Powerful Watermark Removal API
Convert images to structured documents and answer questions
Generate high-quality images from text prompts
Convert audio to text with context and language options
Generate web application code from descriptions
Try on clothes virtually by uploading images
Remove background from images
Swap faces in images
Generate images from text prompts
High-fidelity 3D Geometry Generation from single view image
Edit images based on user instructions
Embedding Leaderboard
The ultimate guide to training LLM on large GPU Clusters
Image-to-3D Generation
generate a video from an image with a text prompt
ChatGPT with real-time web search & URL reading capability
Generate Gradio app code from user requests
Clarity AI Upscaler Reproduction
Generate 3D CAD models from images
VoxCPM