mistralai/Mistral-Small-3.2-24B-Instruct-2506 Image-Text-to-Text β’ 24B β’ Updated 7 days ago β’ 116k β’ 347
OpenBuddy/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview2-QAT Text Generation β’ 33B β’ Updated 24 days ago β’ 220 β’ 12
view post Post 2243 π Whisper-OCR Multilingual Translation Space πWelcome! This Space takes English audio, video, images, and PDFs and instantly converts them into Chinese (ZH), Thai (TH), and Russian (RU)βno other source language required. VIDraft/voice-transβ¨ Key Featuresπ€ Microphoneββ Record English speech β transcript + 3-language translationπ Audio Fileββ Upload English audio β transcript + translationπ¬ Video Fileββ Auto-extract audio with FFmpeg β transcript + translationπΌοΈ Imageββ Nanonets-OCR pulls text β translationπ PDFββ Up to 50 pages of text & tables β translationπ Realtime Modeββ Flush every 10-15 s; newest lines appear at the topπ οΈ Quick StartClick βDuplicateβ to fork, or launch directly.Pick a tab (π€/π/π¬/πΌοΈ/π/π) and feed it English input.After a few seconds, see the π original and π 3-language translation side by side.β‘ Tech Stackopenai/whisper-large-v3-turbo β fast, high-accuracy ASRNanonets-OCR-s (+ Flash Attention 2) β document/image OCRGradio Blocks β clean tabbed UIPyTorch + CUDA β auto GPU allocation & ThreadPool load balancingπ NotesTranslation quality depends on audio quality, lighting, and resolution.Huge videos hit the HF Space upload cap (~2 GB).Realtime tab requires browser microphone permission. See translation π₯ 11 11 π 5 5 π 3 3 π 2 2 + Reply
OptimusePrime/Magistral-Small-2506-Vision Image-Text-to-Text β’ 24B β’ Updated 28 days ago β’ 195 β’ 8
OpenBuddy/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview1-QAT Text Generation β’ 33B β’ Updated Jun 11 β’ 64 β’ 3