Questions tagged [ocr]

Ask Question

Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text.

76 questions

2 votes

1 answer

49 views

How do I get Document AI to recognise a special type of checkbox?

I am using Google Cloud Document AI Custom Extractor to get data from an exam paper. For some reason, it doesn't understand the checkbox, returning an f1 score of 0 on both a pretrained model and a ...

mranonymous7122

asked Aug 30 at 12:43

2 votes

0 answers

25 views

Preprocess function using OpenCV not accurate enough for latin characters when using Tesseract 5

This is the full repo. https://github.com/MortalWombat-repo/ebrojevi_ocr_api this is the app.py preprocess function from the repo where opencv is used https://github.com/MortalWombat-repo/...

Ico

asked May 26 at 11:39

0 votes

1 answer

62 views

My keras CNN model gives good predictions in 10/11 classes but missleads to the 11th class. What can I do to improve?

My project involves classifying printed numerical characters from real-life essays. My dataset includes 11 classes ('0' - '10'), with the label '10' representing the '/' symbol. The issue is that ...

Mai Khanh

asked Jul 3, 2024 at 4:38

1 vote

1 answer

590 views

text extraction from bank statements from pdf format

I have bank statement memos containing transaction tables I need to extract. I only need to extract the transactions list. I have tried to use the Amazon text extractor, LayoutLM but since every bank ...

Abduhoshim

asked Jan 24, 2024 at 11:21

1 vote

0 answers

125 views

Highlight specific paragraphs from documents

I have a bunch of documents in which I want to highlight certain paragraphs/keyphrases. I have a list of the most frequently appearing sentences and I want to search for these paragraphs/keyphrases in ...

spectre

2,288

asked Jun 27, 2023 at 6:40

0 votes

0 answers

352 views

CNN Model Outputting All Zero's

I'm pretty new to machine learning and am having trouble with creating my first TensorFlow convolutional neural network. I'm using datasets from http://etlcdb.db.aist.go.jp/ and trying to get my ...

al3xbro

asked Jun 1, 2023 at 8:48

1 vote

1 answer

59 views

How are OCR training datasets constructed?

For the sake of concreteness: let's suppose that the word "OCR" refers to any OCR system build on an R-CNN architecture. Similarly, in aims of simplicity, let's declare that we are ...

Ramiro Hum-Sah

asked Oct 26, 2022 at 14:57

0 votes

1 answer

330 views

OCR with grouped text based on solid rectangles

I can read text from an image using OCR. However, it works line by line. I want to now group text based on solid lines surrounding the text. For example, consider I have below rectangle banners. I can ...

harpan

asked Jun 9, 2022 at 22:46

15 30 50 per page

2 3 4 5 6 Next

Stack Exchange Network

Questions tagged [ocr]

How do I get Document AI to recognise a special type of checkbox?

Preprocess function using OpenCV not accurate enough for latin characters when using Tesseract 5

My keras CNN model gives good predictions in 10/11 classes but missleads to the 11th class. What can I do to improve?

text extraction from bank statements from pdf format

Highlight specific paragraphs from documents

CNN Model Outputting All Zero's

How are OCR training datasets constructed?

OCR with grouped text based on solid rectangles

Hot Network Questions