Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
microsoft 's Collections
NatureLM
NextCoder
Phi-4
Phi-3
Phi-1
Controllable Safety Alignment
BitNet
MAI-DS-R1
LLM2CLIP
SpeechT5
TAPEX
Table Transformer
LayoutLM
Biomedical
Orca
UDOP
GIT
Florence
IFMs
MoCapAct

LayoutLM

updated May 1

The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA.

Upvote
19

  • microsoft/layoutlmv3-base

    0.1B • Updated Apr 10, 2024 • 1.54M • 417

    Note Currently the best LayoutLM model.


  • microsoft/layoutlmv2-base-uncased

    Updated Sep 16, 2022 • 146k • 67

  • microsoft/layoutlm-base-uncased

    0.1B • Updated Apr 16, 2024 • 2.98M • 57

  • microsoft/layoutxlm-base

    Updated Sep 16, 2022 • 33.7k • 72

    Note A multilingual variant trained on 100 languages.


  • impira/layoutlm-document-qa

    Document Question Answering • 0.1B • Updated Mar 18, 2023 • 23.9k • 1.12k

    Note A LayoutLM (v1) model fine-tuned to perform question answering over documents (DocVQA).


  • nielsr/layoutlmv3-finetuned-funsd

    Token Classification • 0.1B • Updated Sep 16, 2023 • 2.55k • • 26

    Note A LayoutLMv3 model fine-tuned on the FUNSD dataset, a benchmark for document parsing.

Upvote
19
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs