Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2204.08387

Papers - Document AI

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - OCR - Tesseract for Text Location

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - Table Structure Recognition

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Image - OCR

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2
Text Role Classification in Scientific Charts Using Multimodal Transformers

Paper • 2402.14579 • Published Feb 8 • 1
An inclusive review on deep learning techniques and their scope in handwriting recognition

Paper • 2404.08011 • Published Apr 10 • 1

Papers - Image - Tabular

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2

Papers - Documents - Tabular

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
More efficient manual review of automatically transcribed tabular data

Paper • 2306.16126 • Published Jun 28, 2023 • 1
CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

Paper • 2004.12629 • Published Apr 27, 2020 • 2

Papers - Document - OCR

Noise-Aware Training of Layout-Aware Language Models

Paper • 2404.00488 • Published Mar 30 • 7
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Paper • 2203.08411 • Published Mar 16, 2022 • 1
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Paper • 2305.02549 • Published May 4, 2023 • 6
ETC: Encoding Long and Structured Inputs in Transformers

Paper • 2004.08483 • Published Apr 17, 2020 • 1

Papers - Documents - LayoutLM

Noise-Aware Training of Layout-Aware Language Models

Paper • 2404.00488 • Published Mar 30 • 7
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

Paper • 2204.08387 • Published Apr 18, 2022 • 2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Paper • 2012.14740 • Published Dec 29, 2020 • 1
LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Paper • 1912.13318 • Published Dec 31, 2019 • 2

Papers - Microsoft

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 32
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Paper • 2403.19655 • Published Mar 28 • 18
WavLLM: Towards Robust and Adaptive Speech Large Language Model

Paper • 2404.00656 • Published Mar 31 • 10
Enabling Memory Safety of C Programs using LLMs

Paper • 2404.01096 • Published Apr 1 • 1

Awesome Document AI

A collection of open-source document AI 📄 📝 📈

Running on Zero

82

🏃

UDOP
Running on Zero

37

📚

Pix2struct

Play with all the pix2struct variants in this d
Running

24

🦀

Compare Docvqa Models

Compare different visual question answering
Runtime error

289

🦉

DocQuery — Document Query Engine

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs