Mwangi PRO

Benson

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

kahrendt/microwakeword

liked a dataset 7 days ago

SparkAudio/voxbox

upvoted an article 7 days ago

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

View all activity

Organizations

None yet

liked a dataset 3 days ago

kahrendt/microwakeword

Updated Nov 22, 2024 • 495 • 4

liked a dataset 7 days ago

SparkAudio/voxbox

Viewer • Updated Apr 15 • 23.8M • 15.1k • 56

upvoted 2 articles 7 days ago

Article

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Nov 21

•

Article

LLM based Audio models

12 days ago

•

liked a model 7 days ago

YatharthS/MiraTTS

Text-to-Speech • 0.5B • Updated 6 days ago • 3.63k • 153

liked a model 10 days ago

google/medasr

Automatic Speech Recognition • Updated 8 days ago • 4.07k • 186

liked a dataset 13 days ago

kolerk/Video_Reality_Test

Viewer • Updated 7 days ago • 149 • 1.45k • 6

upvoted a paper 13 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 15 days ago • 63

liked a model 14 days ago

meituan-longcat/LongCat-Video-Avatar

Updated 13 days ago • 785 • 187

updated a collection 15 days ago

interestingai

Collection

7 items • Updated 15 days ago

upvoted a paper 15 days ago

Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs

Paper • 2506.12509 • Published Jun 14 • 2

liked a model 19 days ago

zai-org/RealVideo

Any-to-Any • Updated 19 days ago • 94

liked a dataset 20 days ago

kunli-cs/MMA-52

Viewer • Updated Jul 25 • 6.53k • 58 • 3

liked a Space 20 days ago

Qwen3 Omni Demo

⚡

226

Generate audio responses from text and media inputs

upvoted a paper 21 days ago

Scaling Zero-Shot Reference-to-Video Generation

Paper • 2512.06905 • Published 23 days ago • 28

upvoted a paper 22 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 26 days ago • 167

liked a dataset 22 days ago

sarulab-speech/yodas2_sidon

Viewer • Updated 24 days ago • 1.46M • 20.5k • 51

upvoted a paper 23 days ago

PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published 28 days ago • 64

liked 2 models 23 days ago

meituan-longcat/LongCat-Image-Edit

Image-to-Image • Updated 14 days ago • 58.6k • • 146

meituan-longcat/LongCat-Image

Text-to-Image • Updated 14 days ago • 61.3k • • 225

Mwangi PRO

AI & ML interests

Recent Activity

Organizations

Benson's activity

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

LLM based Audio models

Qwen3 Omni Demo