k4tel commited on
Commit
922f057
Β·
verified Β·
1 Parent(s): cfb9873

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -8
README.md CHANGED
@@ -3,11 +3,14 @@ library_name: transformers
3
  tags:
4
  - page
5
  - classification
 
 
 
6
  ---
7
 
8
  # Image processing using ViT - for historical documents
9
 
10
- ### Goal: This project solves a task of page images classification
11
 
12
  **Scope:** Processing of images, training and evaluation of ViT model,
13
  input file/directory processing, class (category) results of top
@@ -16,8 +19,6 @@ HF 😊 hub support for the model
16
 
17
  ## Model description
18
 
19
- Fine-tuned model files can be found here: [huggingface.co/k4tel/vit-historical-page](https://huggingface.co/k4tel/vit-historical-page) πŸ”—
20
-
21
  - **Developed by** UFAL
22
  - **Funded by** ATRIUM
23
  - **Shared by** ATRIUM & UFAL
@@ -74,17 +75,19 @@ Evaluation set (same proportions): **995** images
74
 
75
  Evaluation set's accuracy (**Top-3**): **99.6%**
76
 
77
- ![TOP-3 confusion matrix - trained ViT](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/plots/20250209-1526_conf_mat.png?raw=true)
 
 
78
 
79
  Evaluation set's accuracy (**Top-1**): **97.3%**
80
 
81
- ![TOP-1 confusion matrix - trained ViT](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/plots/20250218-1523_conf_mat.png?raw=true)
82
 
83
  #### Result tables
84
 
85
- - Manually ✍ **checked** evaluation dataset results (TOP-3): [model_TOP-3_EVAL.csv](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/tables/20250209-1534_model_1119_3_TOP-3_EVAL.csv) πŸ”—
86
 
87
- - Manually ✍ **checked** evaluation dataset results (TOP-1): [model_TOP-1_EVAL.csv](https://github.com/K4TEL/ltp-ocr/blob/transformer/result/tables/20250218-1519_model_1119_3_TOP-1_EVAL.csv) πŸ”—
88
 
89
  #### Table columns
90
 
@@ -96,4 +99,4 @@ Evaluation set's accuracy (**Top-1**): **97.3%**
96
 
97
  #### Contacts
98
 
99
- For support write to πŸ“§ [email protected] πŸ“§
 
3
  tags:
4
  - page
5
  - classification
6
+ base_model:
7
+ - google/vit-base-patch16-224
8
+ pipeline_tag: image-classification
9
  ---
10
 
11
  # Image processing using ViT - for historical documents
12
 
13
+ ### Goal: This project solves a task of page images classification (for their further content-based processing)
14
 
15
  **Scope:** Processing of images, training and evaluation of ViT model,
16
  input file/directory processing, class (category) results of top
 
19
 
20
  ## Model description
21
 
 
 
22
  - **Developed by** UFAL
23
  - **Funded by** ATRIUM
24
  - **Shared by** ATRIUM & UFAL
 
75
 
76
  Evaluation set's accuracy (**Top-3**): **99.6%**
77
 
78
+ https://github.com/K4TEL/atrium-ufal
79
+
80
+ ![TOP-3 confusion matrix - trained ViT](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/plots/20250209-1526_conf_mat.png?raw=true)
81
 
82
  Evaluation set's accuracy (**Top-1**): **97.3%**
83
 
84
+ ![TOP-1 confusion matrix - trained ViT](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/plots/20250218-1523_conf_mat.png?raw=true)
85
 
86
  #### Result tables
87
 
88
+ - Manually ✍ **checked** evaluation dataset results (TOP-3): [model_TOP-3_EVAL.csv](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/tables/20250209-1534_model_1119_3_TOP-3_EVAL.csv) πŸ”—
89
 
90
+ - Manually ✍ **checked** evaluation dataset results (TOP-1): [model_TOP-1_EVAL.csv](https://github.com/K4TEL/atrium-ufal/blob/transformer/result/tables/20250218-1519_model_1119_3_TOP-1_EVAL.csv) πŸ”—
91
 
92
  #### Table columns
93
 
 
99
 
100
  #### Contacts
101
 
102
+ For support write to πŸ“§ [email protected] πŸ“§