Spaces:

Matthijs
/

speecht5-tts-demo

Runtime error

App Files Files Community

trying to hack together a voice cloning demo....

by sherlock1199 - opened Feb 10, 2023

Discussion

sherlock1199

Feb 10, 2023

I've been trying to create my own custom embeddings using speechbrain/spkrec-xvect-voxceleb

signal, fs =torchaudio.load('morgan.wav')
embeddings = classifier.encode_batch(signal)

and generating audio using:

speech = model.generate_speech(inputs["input_ids"], embeddings[0], vocoder=vocoder)

but having the output garbled. is there an intermediary step i'm missing ?

sherlock1199

Feb 10, 2023

so managed to get a non-garbled output. after resampling my wav file to 16k hz and converting it to mono. now to figure out how to improve the quality of voice reproduction.

NS-Y

Feb 17, 2023

Great work! Where can I find information about fine-tuning to other languages?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment