VideoRefer x VideoLLaMA3
Create and enrich datasets using AI
Generate audio from text with adjustable speed