microsoft
/

layoutlmv2-base-uncased

Inference Endpoints

Model card Files Files and versions Community

Yiheng Xu commited on Aug 11, 2021

Commit

1ec2670

•

1 Parent(s): aae5dd2

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ license: cc-by-sa-4.0
 # LayoutLMv2
 **Multimodal (text + layout/format + image) pre-training for document AI**
-[Microsoft Document AI](https://www.microsoft.com/en-us/research/project/document-ai/) | [Github Repository](https://github.com/microsoft/unilm/tree/master/layoutlmv2)
 ## Introduction
 LayoutLMv2 is an improved version of LayoutLM with new pre-training tasks to model the interaction among text, layout, and image in a single multi-modal framework. It outperforms strong baselines and achieves new state-of-the-art results on a wide variety of downstream visually-rich document understanding tasks, including , including FUNSD (0.7895 → 0.8420), CORD (0.9493 → 0.9601), SROIE (0.9524 → 0.9781), Kleister-NDA (0.834 → 0.852), RVL-CDIP (0.9443 → 0.9564), and DocVQA (0.7295 → 0.8672).

 # LayoutLMv2
 **Multimodal (text + layout/format + image) pre-training for document AI**
+[Microsoft Document AI](https://www.microsoft.com/en-us/research/project/document-ai/) | [GitHub](https://github.com/microsoft/unilm/tree/master/layoutlmv2)
 ## Introduction
 LayoutLMv2 is an improved version of LayoutLM with new pre-training tasks to model the interaction among text, layout, and image in a single multi-modal framework. It outperforms strong baselines and achieves new state-of-the-art results on a wide variety of downstream visually-rich document understanding tasks, including , including FUNSD (0.7895 → 0.8420), CORD (0.9493 → 0.9601), SROIE (0.9524 → 0.9781), Kleister-NDA (0.834 → 0.852), RVL-CDIP (0.9443 → 0.9564), and DocVQA (0.7295 → 0.8672).