A newer version of the Gradio SDK is available:
5.35.0
metadata
title: RAWI Voice to Story Generator
emoji: π
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 3.50.2
app_file: app.py
pinned: false
python_version: 3.9
RAWI Voice to Story Generator
This Hugging Face Space converts Arabic voice recordings into polished stories using Whisper for speech recognition and DeepSeek API for creative text generation.
How It Works
- Upload or record an Arabic audio clip
- The system transcribes the speech using OpenAI's Whisper model
- The transcript is sent to DeepSeek API to generate an enhanced story
- Both the original transcript and the generated story are displayed
Setup
This Space requires a DeepSeek API key to work properly. When deploying:
- Go to the Settings tab of your Space
- Add your DeepSeek API key as a secret named
DEEPSEEK_API_KEY
- (Optional) If needed, change the API endpoint by adding
DEEPSEEK_API_URL
Deploying to Hugging Face Spaces
To deploy this application to Hugging Face Spaces:
- Create a new Space on Hugging Face
- Select "Gradio" as the SDK
- Upload the contents of this directory to your Space
- Set the required secrets in the Space settings
- Choose a suitable hardware tier (recommend at least CPU-M)
Local Development
To run this project locally:
- Clone this repository
- Install dependencies:
pip install -r requirements.txt
- Set environment variables:
export DEEPSEEK_API_KEY=your_deepseek_api_key
- Run the application:
python app.py
Technologies Used
- Whisper: AI-powered speech recognition model
- Gradio: Web interface for ML applications
- DeepSeek API: Arabic text generation and enhancement
Note
This application is designed for Arabic language content. Using other languages may result in suboptimal performance.