Orpheus Bangla (16 bit)

Model Description

This model is a proof-of-concept fine-tuned version of the Orpheus 3B TTS (Text-to-Speech) model for Bengali language support. The model has been trained using the SUST-CSE-Speech/banspeech dataset, which contains 955 audio samples split from audiobooks. This fine-tuning was performed for 10 epochs on a single Google Colab instance equipped with a T4 GPU.

Please note that this model is currently in the proof-of-concept phase and is not recommended for production use.

Intended Use

This model can be used for generating Bengali speech from text. It is ideal for experimenting with TTS systems for Bengali, particularly for audiobooks, conversational AI, or speech synthesis tasks.

Model Training

  • Dataset: SUST-CSE-Speech/banspeech (955 audiobook audio samples)
  • Training Epochs: 10 epochs
  • Hardware: Google Colab (single T4 GPU)
  • Training Script: A modified Unsloth fine-tuning script was used for the training. The script is available on GitHub: Orpheus TTS Training Script.

Limitations

  • This model was trained on a small dataset and for a limited number of epochs, which may lead to less natural or less accurate speech synthesis.
  • Since this is a proof-of-concept model, the synthesis quality may vary based on input text and different conditions. It is not optimized for production environments.

Model Usage


Training Resources:

Downloads last month
3
Safetensors
Model size
3.3B params
Tensor type
FP16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for asif00/orpheus-bangla-tts

Finetuned
(7)
this model
Quantizations
1 model

Dataset used to train asif00/orpheus-bangla-tts

Collection including asif00/orpheus-bangla-tts