metadata

base_model:
  - canopylabs/orpheus-3b-0.1-pretrained
tags:
  - transformers
  - llama
  - gguf
  - text-to-speech
license: apache-2.0
language:
  - bn
datasets:
  - SUST-CSE-Speech/banspeech
pipeline_tag: text-to-speech

Orpheus Bangla GGUF (16 bit)

Model Description

This model is a proof-of-concept fine-tuned version of the Orpheus 3B TTS (Text-to-Speech) model for Bengali language support. The model has been trained using the SUST-CSE-Speech/banspeech dataset, which contains 955 audio samples split from audiobooks. This fine-tuning was performed for 10 epochs on a single Google Colab instance equipped with a T4 GPU.

Please note that this model is currently in the proof-of-concept phase and is not recommended for production use.

Intended Use

This model can be used for generating Bengali speech from text. It is ideal for experimenting with TTS systems for Bengali, particularly for audiobooks, conversational AI, or speech synthesis tasks.

Model Training

Dataset: SUST-CSE-Speech/banspeech (955 audiobook audio samples)
Training Epochs: 10 epochs
Hardware: Google Colab (single T4 GPU)
Training Script: A modified Unsloth fine-tuning script was used for the training. The script is available on GitHub: Orpheus TTS Training Script.

Limitations

This model was trained on a small dataset and for a limited number of epochs, which may lead to less natural or less accurate speech synthesis.
Since this is a proof-of-concept model, the synthesis quality may vary based on input text and different conditions. It is not optimized for production environments.

asif00
/

orpheus-bangla-tts-gguf

Orpheus Bangla GGUF (16 bit)

Model Description

Intended Use

Model Training

Limitations

Model Usage

Training Resources: