Open-source speech datasets annotated using Data-Speech
Open-source annotated speech datasets ranging from 1,000 hours to 45,000 hours.
 Viewer • Updated • 10.8M • 4.75k • 29- Note The English version of the Multilingual LibriSpeech (MLS) dataset. 
 - parler-tts/libritts_r_filteredViewer • Updated • 359k • 801 • 20- Note Filtered version of the 1K high-quality LibriTTS-R dataset. 
 - parler-tts/mls-eng-speaker-descriptionsViewer • Updated • 10.8M • 208 • 10- Note Annotations of English MLS above. Used for v1 training. 
 - parler-tts/libritts-r-filtered-speaker-descriptionsViewer • Updated • 359k • 110 • 7- Note Annotations of the filtered LibriTTS-R dataset. Used for v1 training. 
- 
	
	
	826Parler-TTS🥖High-fidelity Text-To-Speech 
- 
	
	
	Natural language guidance of high-fidelity text-to-speech with synthetic annotationsPaper • 2402.01912 • Published • 12
 - mythicinfinity/libritts_rViewer • Updated • 756k • 1.06k • 30- Note A 1K hours high-quality English speech dataset. 
 - parler-tts/mls_eng_10kViewer • Updated • 2.43M • 996 • 29- Note A 10K hours subset of the English version of the Multilingual LibriSpeech (MLS) dataset. 
 - parler-tts/mls-eng-10k-tags_tagged_10k_generatedViewer • Updated • 2.43M • 17 • 17- Note Annotations of the 10K hours subset of English MLS above. Used for v0.1 training. 
 - parler-tts/libritts_r_tags_tagged_10k_generatedViewer • Updated • 365k • 23 • 8- Note An annotated version of LibriTTS-R above. Used for v0.1 training. 
   - parler-tts/parler_tts_mini_v0.1Text-to-Speech • 0.6B • Updated • 3.91k • 357- Note A first model iteration of Parler-TTS, trained using the 10k hours of narrated audiobooks above. 
 
					 
					