Text-to-Speech

Model Card for f5-tts-hakka-finetune

Model Details

F5-TTS finetune on all formosan data (ithuan, fb ilrdf dict, klokah) without samples only one word or no translation, using ipa as input.
g2p from this repo.

Training Details

  • learning rate: 0.00001
  • batch size per gpu: 9501
  • batch size type: frame
  • max samples: 64
  • grad accumulation steps: 1
  • max grad norm: 1
  • epochs: 158
  • num warmup updates: 20315

Model Sources

Uses

please refer source repo

Demo

https://huggingface.co/spaces/ithuan/formosan-f5-tts

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ithuan/f5-tts-formosan-all-finetune-v2

Base model

SWivid/F5-TTS
Finetuned
(56)
this model

Spaces using ithuan/f5-tts-formosan-all-finetune-v2 2