Extract and recognize text from documents
Convert PDF to text using OCR
Calculate memory usage for training models