metadata
base_model:
- canopylabs/orpheus-3b-0.1-pretrained
tags:
- transformers
- llama
- gguf
- text-to-speech
license: apache-2.0
language:
- bn
datasets:
- SUST-CSE-Speech/banspeech
pipeline_tag: text-to-speech
Orpheus Bangla GGUF (16 bit)
Model Description
This model is a proof-of-concept fine-tuned version of the Orpheus 3B TTS (Text-to-Speech) model for Bengali language support. The model has been trained using the SUST-CSE-Speech/banspeech
dataset, which contains 955 audio samples split from audiobooks. This fine-tuning was performed for 10 epochs on a single Google Colab instance equipped with a T4 GPU.
Please note that this model is currently in the proof-of-concept phase and is not recommended for production use.
Intended Use
This model can be used for generating Bengali speech from text. It is ideal for experimenting with TTS systems for Bengali, particularly for audiobooks, conversational AI, or speech synthesis tasks.
Model Training
- Dataset:
SUST-CSE-Speech/banspeech
(955 audiobook audio samples) - Training Epochs: 10 epochs
- Hardware: Google Colab (single T4 GPU)
- Training Script: A modified Unsloth fine-tuning script was used for the training. The script is available on GitHub: Orpheus TTS Training Script.
Limitations
- This model was trained on a small dataset and for a limited number of epochs, which may lead to less natural or less accurate speech synthesis.
- Since this is a proof-of-concept model, the synthesis quality may vary based on input text and different conditions. It is not optimized for production environments.
Model Usage