Image2Caption, useful for dataset making and AI agents.
Create videos with FFMPEG + Qwen2.5-Coder
Versatile audio super resolution (any -> 48kHz) with AudioSR
YouTube downloader featuring 4k support built with gradio.
A Gradio app using MusicGen for music enhancement.