metadata
license: mit
library_name: transformers
tags:
- donut
- classification
- irs
- tax
- document AI
Donut - model fine-tuned for US IRS tax documents classification
This donut model has been fine-tuned for IRS (US) tax document classification. It can classify up to 28 different types of IRS documents, targeting common set of documents used for tax returns.
- 1040 U.S. Individual Income Tax Return
- 1040-NR U.S. Nonresident Alien Income Tax Return
- 1040-NR SCHEDULE OI Other Information
- 1040 SCHEDULE 1 Additional Income and Adjustments to Income
- 1040 SCHEDULE 2 Additional Taxes
- 1040 SCHEDULE 3 Additional Credits and Payments
- 1040 SCHEDULE 8812 Credits for Qualifying Children and Other Dependents
- 1040 SCHEDULE A Itemized Deductions
- 1040 SCHEDULE B Interest and Ordinary Dividends
- 1040 SCHEDULE C Profit or Loss From Business
- 1040 SCHEDULE D Capital Gains and Losses
- 1040 SCHEDULE E Supplemental Income and Loss
- 1040 SCHEDULE SE Self-Employment Tax
- Form 1125-A Cost of Goods Sold
- Form 8949 Sales and Other Dispositions of Capital Assets
- Form 8959 Additional Medicare Tax
- Form 8960 Net Investment Income Tax — Individuals, Estates, and Trusts
- Form 8995 Qualified Business Income Deduction Simplified Computation
- Form 8995-A SCHEDULE A Specified Service Trades or Businesses
- Form W-2 Wage and Tax Statement
Model Details & Description
The base model is 'naver-clova-ix/donut-base-finetuned-rvlcdip', the model is finetuned using training data set of over 3000+ documents. The config.json file has assocociated label2id updated to reflect all labels that can be classified vi the model.