Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

naver-clova-ix
/
donut-base

Image-to-Text
Transformers
PyTorch
vision-encoder-decoder
image-text-to-text
donut
vision
Model card Files Files and versions Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Fine tuning using LoRa

#14 opened 4 days ago by
DenisMir

Error Fine Tuning due to unexpected keyword argument

1
#13 opened 3 months ago by
reganshen

How to extract all the text from the document?

1
#12 opened 10 months ago by
Maz369

add _name_or_path

#11 opened 12 months ago by
nbroad

Why torch.compile has very small acceleration for Donut model?

πŸ‘€ 1
1
#10 opened about 1 year ago by
gorodnitskiy

Will this modle suitable for invoce processing ?

2
#9 opened over 1 year ago by
total008

Discrepancies between DONUT / BART Tokenizer and missing characters

πŸ‘ 9
1
#8 opened over 1 year ago by
DieseKartoffel

Adding `safetensors` variant of this model

#7 opened over 1 year ago by
SFconvertbot

Architecture of donut

#6 opened almost 2 years ago by
shubham05

Change image_mean and image_std to ImageNet to match original codebase

#5 opened over 2 years ago by
morgan

Failed to download Donut Processor

πŸ‘ 1
5
#3 opened over 2 years ago by
paturi1710

Minimum GPU requirement to fine-tune donut model

#2 opened almost 3 years ago by
LeonardoVaz
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs