metadata

title: RAWI Voice to Story Generator
emoji: 📝
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 3.50.2
app_file: app.py
pinned: false
python_version: 3.9

RAWI Voice to Story Generator

This Hugging Face Space converts Arabic voice recordings into polished stories using Whisper for speech recognition and DeepSeek API for creative text generation.

How It Works

Upload or record an Arabic audio clip
The system transcribes the speech using OpenAI's Whisper model
The transcript is sent to DeepSeek API to generate an enhanced story
Both the original transcript and the generated story are displayed

Setup

This Space requires a DeepSeek API key to work properly. When deploying:

Go to the Settings tab of your Space
Add your DeepSeek API key as a secret named DEEPSEEK_API_KEY
(Optional) If needed, change the API endpoint by adding DEEPSEEK_API_URL

Deploying to Hugging Face Spaces

To deploy this application to Hugging Face Spaces:

Create a new Space on Hugging Face
Select "Gradio" as the SDK
Upload the contents of this directory to your Space
Set the required secrets in the Space settings
Choose a suitable hardware tier (recommend at least CPU-M)

Local Development

To run this project locally:

Clone this repository
Install dependencies: pip install -r requirements.txt

Set environment variables:

export DEEPSEEK_API_KEY=your_deepseek_api_key

Run the application: python app.py

Technologies Used

Whisper: AI-powered speech recognition model
Gradio: Web interface for ML applications
DeepSeek API: Arabic text generation and enhancement

Note

This application is designed for Arabic language content. Using other languages may result in suboptimal performance.