MrDragonFox's picture

MrDragonFox PRO

MrDragonFox

AI & ML interests

llm + audio i/o, (un)alignment

Recent Activity

updated a collection about 18 hours ago
full wordtimestamps (nc)
updated a collection about 18 hours ago
full wordtimestamps (nc)
updated a collection about 18 hours ago
full wordtimestamps (nc)
View all activity

Organizations

DeepGHS's profile picture Blog-explorers's profile picture SynthoCraft Ai's profile picture FoxEngineAi's profile picture Social Post Explorers's profile picture Mistral AI Game Jam's profile picture

Posts 2

view post
Post
2402
yet a other audio datasets pre classified for events + audio aestetics

this time for german - 680h sampled from emilia yodas

timestamps for asr training or other fancier things available as nc in the raw repo

MrDragonFox/DE_Emilia_Yodas_680h

cc by 4.0 as by emilia yodas

raw events / transcriptions are cc by NC 4.0

MrDragonFox/DE_Emilia_Yodas_680h_raw_timestamps

the coming days i should push about 600h english + some japanese too same format
view post
Post
1912
did a small emotive classified test dataset for all the tts tuners out there

MrDragonFox/Elise

3h total mit - single speaker voice

dataset is a copy of an existing one just added the emotional tags over 1200 samples - should be good enough to test if emotional tags stick in your finetune