bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated 5 days ago • 52.2k • 342
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 288k • 1.46k
moonshotai/Kimi-VL-A3B-Thinking-2506 Image-Text-to-Text • 16B • Updated about 1 month ago • 41.1k • 236
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training Paper • 2506.05301 • Published Jun 5 • 55