Gemini 2.0 native image generation co-doodling
Generate edited images with prompts
VGGT (CVPR 2025)
Detect objects in images or videos
Wan: Open and Advanced Large-Scale Video Generative Models
A Generalist Diffusion Model for Vision Perception
Select and display code snippets for AI providers
Text-to-3D and Image-to-3D Generation
Scalable and Versatile 3D Generation from images
https://huggingface.co/papers/2501.03006
Extend images using prompts and alignment options
Convert images to 3D depth maps
Generate modified images from prompts with styles
create games with AI
Quickly edit the expression of a face
Generate and edit audio from text prompts
High-fidelity Virtual Try-on
Overlay garment on person image