Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dfahey 's Collections
ocr

ocr

updated 12 days ago
Upvote
-

  • mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

    Paper • 2403.12895 • Published Mar 19, 2024 • 32

  • microsoft/layoutlm-base-uncased

    0.1B • Updated Apr 16, 2024 • 141k • 60

  • microsoft/layoutlmv3-base

    0.1B • Updated Apr 10, 2024 • 627k • 458

  • naver-clova-ix/donut-base-finetuned-docvqa

    Document Question Answering • Updated Mar 9, 2024 • 198k • 255

  • microsoft/udop-large

    Image-Text-to-Text • 0.7B • Updated Mar 11, 2024 • 15.7k • 120

  • impira/layoutlm-document-qa

    Document Question Answering • 0.1B • Updated Mar 18, 2023 • 33.5k • 1.15k

  • InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

    Paper • 2401.13313 • Published Jan 24, 2024 • 5

  • DocLLM: A layout-aware generative language model for multimodal document understanding

    Paper • 2401.00908 • Published Dec 31, 2023 • 189

  • mPLUG/DocOwl1.5

    8B • Updated Apr 10, 2024 • 19 • 26

  • olmOCR 2: Unit Test Rewards for Document OCR

    Paper • 2510.19817 • Published 29 days ago • 13
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs