pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
- Updated
Mar 19, 2026 - C#
pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
A lightweight, Open Source Python library for transliterating and normalizing Unicode text to Latin ASCII using configurable mappings and Unicode normalization forms, written in Python.
A lightweight, Open Source transliterate.js library that converts accented, special, and non-Latin characters into plain, readable text - written in JavaScript.
Add a description, image, and links to the diacritic topic page so that developers can more easily learn about it.
To associate your repository with the diacritic topic, visit your repo's landing page and select "manage topics."