wanderkid's picture
Add new table recognition model: StructEqTable
e34ece3
|
raw
history blame
1.78 kB

Install Git LFS

Before you begin, make sure Git Large File Storage (Git LFS) is installed on your system. Install it using the following command:

git lfs install

Download the Model from Hugging Face

To download the PDF-Extract-Kit model from Hugging Face, use the following command:

git lfs clone https://huggingface.co/wanderkid/PDF-Extract-Kit

Ensure that Git LFS is enabled during the clone to properly download all large files.

Download the Model from ModelScope

SDK Download

# First, install the ModelScope library using pip:
pip install modelscope
# Use the following Python code to download the model using the ModelScope SDK:
from modelscope import snapshot_download
model_dir = snapshot_download('wanderkid/PDF-Extract-Kit')

Git Download

Alternatively, you can use Git to clone the model repository from ModelScope:

git clone https://www.modelscope.cn/wanderkid/PDF-Extract-Kit.git

Put model files here:

./
β”œβ”€β”€ Layout
β”‚   β”œβ”€β”€ config.json
β”‚   └── model_final.pth
β”œβ”€β”€ MFD
β”‚   └── weights.pt
β”œβ”€β”€ MFR
β”‚   └── UniMERNet
β”‚       β”œβ”€β”€ config.json
β”‚       β”œβ”€β”€ preprocessor_config.json
β”‚       β”œβ”€β”€ pytorch_model.bin
β”‚       β”œβ”€β”€ README.md
β”‚       β”œβ”€β”€ tokenizer_config.json
β”‚       └── tokenizer.json
β”œβ”€β”€ TabRec
β”‚   └── StructEqTable
β”‚       β”œβ”€β”€ config.json
β”‚       β”œβ”€β”€generation_config.json
β”‚       β”œβ”€β”€model.safetensors
β”‚       β”œβ”€β”€preprocessor_config.json
β”‚       β”œβ”€β”€special_tokens_map.json
β”‚       β”œβ”€β”€spiece.model
β”‚       β”œβ”€β”€tokenizer_config.json
β”‚       └──tokenizer.json
└── README.md