Extract audio from video and transcribe it
Extract text from a PDF file
Convert PDFs to text using OCR