Questions tagged [ocr]
Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text.
76 questions
2 votes
1 answer
49 views
How do I get Document AI to recognise a special type of checkbox?
I am using Google Cloud Document AI Custom Extractor to get data from an exam paper. For some reason, it doesn't understand the checkbox, returning an f1 score of 0 on both a pretrained model and a ...
2 votes
0 answers
25 views
Preprocess function using OpenCV not accurate enough for latin characters when using Tesseract 5
This is the full repo. https://github.com/MortalWombat-repo/ebrojevi_ocr_api this is the app.py preprocess function from the repo where opencv is used https://github.com/MortalWombat-repo/...
0 votes
1 answer
62 views
My keras CNN model gives good predictions in 10/11 classes but missleads to the 11th class. What can I do to improve?
My project involves classifying printed numerical characters from real-life essays. My dataset includes 11 classes ('0' - '10'), with the label '10' representing the '/' symbol. The issue is that ...
1 vote
1 answer
590 views
text extraction from bank statements from pdf format
I have bank statement memos containing transaction tables I need to extract. I only need to extract the transactions list. I have tried to use the Amazon text extractor, LayoutLM but since every bank ...
1 vote
0 answers
125 views
Highlight specific paragraphs from documents
I have a bunch of documents in which I want to highlight certain paragraphs/keyphrases. I have a list of the most frequently appearing sentences and I want to search for these paragraphs/keyphrases in ...
0 votes
0 answers
352 views
CNN Model Outputting All Zero's
I'm pretty new to machine learning and am having trouble with creating my first TensorFlow convolutional neural network. I'm using datasets from http://etlcdb.db.aist.go.jp/ and trying to get my ...
1 vote
1 answer
59 views
How are OCR training datasets constructed?
For the sake of concreteness: let's suppose that the word "OCR" refers to any OCR system build on an R-CNN architecture. Similarly, in aims of simplicity, let's declare that we are ...
0 votes
1 answer
330 views
OCR with grouped text based on solid rectangles
I can read text from an image using OCR. However, it works line by line. I want to now group text based on solid lines surrounding the text. For example, consider I have below rectangle banners. I can ...