k4tel
/

vit-historical-page

Image Classification

Model card Files Files and versions

k4tel commited on Mar 3

Commit

922f057

·

verified ·

1 Parent(s): cfb9873

Update README.md

Files changed (1) hide show

README.md +11 -8

README.md CHANGED Viewed

@@ -3,11 +3,14 @@ library_name: transformers
 tags:
 - page
 - classification
 ---
 # Image processing using ViT - for historical documents
-### Goal: This project solves a task of page images classification
 **Scope:** Processing of images, training and evaluation of ViT model,
 input file/directory processing, class (category) results of top
@@ -16,8 +19,6 @@ HF 😊 hub support for the model
 ## Model description
-Fine-tuned model files can be found here:  [huggingface.co/k4tel/vit-historical-page](https://huggingface.co/k4tel/vit-historical-page) 🔗
 - **Developed by** UFAL
 - **Funded by** ATRIUM
 - **Shared by** ATRIUM & UFAL
@@ -74,17 +75,19 @@ Evaluation set (same proportions):	**995** images
 Evaluation set's accuracy (**Top-3**):  **99.6%**
-![TOP-3 confusion matrix - trained ViT](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/plots/20250209-1526_conf_mat.png?raw=true)
 Evaluation set's accuracy (**Top-1**):  **97.3%**
-![TOP-1 confusion matrix - trained ViT](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/plots/20250218-1523_conf_mat.png?raw=true)
 #### Result tables
-- Manually ✍ **checked** evaluation dataset results (TOP-3): [model_TOP-3_EVAL.csv](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/tables/20250209-1534_model_1119_3_TOP-3_EVAL.csv) 🔗
-- Manually ✍ **checked** evaluation dataset results (TOP-1): [model_TOP-1_EVAL.csv](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/tables/20250218-1519_model_1119_3_TOP-1_EVAL.csv) 🔗
 #### Table columns
@@ -96,4 +99,4 @@ Evaluation set's accuracy (**Top-1**):  **97.3%**
 #### Contacts
-For support write to 📧 [email protected] 📧

 tags:
 - page
 - classification
+base_model:
+- google/vit-base-patch16-224
+pipeline_tag: image-classification
 ---
 # Image processing using ViT - for historical documents
+### Goal: This project solves a task of page images classification (for their further content-based processing)
 **Scope:** Processing of images, training and evaluation of ViT model,
 input file/directory processing, class (category) results of top
 ## Model description
 - **Developed by** UFAL
 - **Funded by** ATRIUM
 - **Shared by** ATRIUM & UFAL
 Evaluation set's accuracy (**Top-3**):  **99.6%**
+https://github.com/K4TEL/atrium-ufal
+![TOP-3 confusion matrix - trained ViT](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/plots/20250209-1526_conf_mat.png?raw=true)
 Evaluation set's accuracy (**Top-1**):  **97.3%**
+![TOP-1 confusion matrix - trained ViT](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/plots/20250218-1523_conf_mat.png?raw=true)
 #### Result tables
+- Manually ✍ **checked** evaluation dataset results (TOP-3): [model_TOP-3_EVAL.csv](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/tables/20250209-1534_model_1119_3_TOP-3_EVAL.csv) 🔗
+- Manually ✍ **checked** evaluation dataset results (TOP-1): [model_TOP-1_EVAL.csv](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/tables/20250218-1519_model_1119_3_TOP-1_EVAL.csv) 🔗
 #### Table columns
 #### Contacts
+For support write to 📧 [email protected] 📧