Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shivam Mehta's picture
2 8 8

Shivam Mehta

shivammehta25
birgermoell's profile picture 21world's profile picture mondalsurojit's profile picture
·
http://www.shivammehta.me
  • shivammehta007
  • shivammehta007
  • shivammehta25

AI & ML interests

Speech, Audio, LLM, Flow Matching, Diffusion, Flows, HMMs

Organizations

KTH's profile picture Merge Crew's profile picture Hugging Face Discord Community's profile picture Speech, Music and Hearing (TMH)'s profile picture

authored a paper over 1 year ago

Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis

Paper • 2404.19622 • Published Apr 30, 2024 • 2
authored 6 papers almost 2 years ago

Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation

Paper • 2309.05455 • Published Sep 11, 2023

Prosody-controllable spontaneous TTS with neural HMMs

Paper • 2211.13533 • Published Nov 24, 2022

OverFlow: Putting flows on top of neural transducers for better TTS

Paper • 2211.06892 • Published Nov 13, 2022

Neural HMMs are all you need (for high-quality attention-free TTS)

Paper • 2108.13320 • Published Aug 30, 2021

Matcha-TTS: A fast TTS architecture with conditional flow matching

Paper • 2309.03199 • Published Sep 6, 2023 • 12

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

Paper • 2306.09417 • Published Jun 15, 2023 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs