gradio numpy pandas torch transformers sentence-transformers spacy en-core-web-lg @ https://github.com/explosion/spacy-models/releases/download/en_core_web_lg-3.7.0/en_core_web_lg-3.7.0-py3-none-any.whl optuna unstructured PyMuPDF pillow pytesseract scikit-learn geopy numba python-docx beautifulsoup4 tqdm protobuf regex nltk python-magic markdown-it-py fastapi uvicorn pytest flake8 tensorboardX huggingface-hub tokenizers setuptools wheel psutil aiohttp pdf2image layoutparser opencv-python pdfplumber pi-heif decorator unstructured-inference unstructured.pytesseract sentencepiece python-magic pdfminer.six antiword