Here are 10 public repositories matching this topic...
OCR engine for all the languages
Updated Nov 28, 2025 Python Probabilistic Key Value pair extraction using word weights from Invoices - Non Searchable PDF
Updated Jun 12, 2021 Python ✏️ Integration of Tesseract for Python using a shared library
Updated Mar 25, 2016 Python Python parser for hOCR files using lxml
Updated Aug 23, 2020 Python graphical HOCR editor to produce minimal diffs for proofreading of tesseract OCR output
Updated Oct 25, 2025 Python Updated Dec 8, 2019 Python Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
Updated Nov 24, 2025 Python Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
Updated Oct 3, 2023 Python OCR engine for all the languages
Updated Jan 6, 2023 Python TIFF Image - Converted into OCR XML using Tesseract
Updated Mar 9, 2024 Python Improve this page Add a description, image, and links to the hocr topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo To associate your repository with the hocr topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.