Spaces:

fnarudee44
/

tordtapeai01

Runtime error

aliceer commited on Jan 14

Commit

049d5e8

0 Parent(s):

initial commit

Files changed (3) hide show

README.md ADDED Viewed

+# Thai Speech Recognition App
+แอปพลิเคชันแปลงเสียงพูดภาษาไทยเป็นข้อความ โดยใช้โมเดล Whisper ที่ปรับแต่งสำหรับภาษาไทย
+## วิธีใช้งาน
+1. อัปโหลดไฟล์เสียงที่มีการพูดภาษาไทย (รองรับไฟล์เสียงหลายรูปแบบ เช่น .mp3, .wav)
+2. รอสักครู่ระบบจะประมวลผลและแสดงข้อความที่ถอดความได้
+3. สามารถอัปโหลดไฟล์ใหม่เพื่อทำการถอดความต่อไปได้
+## เทคโนโลยีที่ใช้
+- Hugging Face Transformers
+- Whisper Model (biodatlab/whisper-th-medium-combined)
+- Gradio
+- PyTorch

app.py ADDED Viewed

+import torch
+import gradio as gr
+from transformers import pipeline
+MODEL_NAME = "biodatlab/whisper-th-medium-combined"
+device = 0 if torch.cuda.is_available() else "cpu"
+pipe = pipeline(
+    task="automatic-speech-recognition",
+    model=MODEL_NAME,
+    chunk_length_s=30,
+    device=device,
+)
+def transcribe_audio(audio_file):
+    try:
+        result = pipe(audio_file, generate_kwargs={"language":"<|th|>", "task":"transcribe"}, batch_size=16)
+        return result["text"]
+    except Exception as e:
+        return str(e)
+# สร้าง Gradio Interface
+demo = gr.Interface(
+    fn=transcribe_audio,
+    inputs=gr.Audio(type="filepath"),
+    outputs="text",
+    title="Thai Speech Recognition",
+    description="อัปโหลดไฟล์เสียงภาษาไทยเพื่อแปลงเป็นข้อความ",
+)
+if __name__ == "__main__":
+    demo.launch()

requirements.txt ADDED Viewed

+transformers
+torch
+librosa
+gradio