8

i need an advice on how to approach this problem. I have some picture data: *.jpg, *.bmp ... and i need to extract the data from it. The data is alphanumeric text. I work in delphi.

1
  • You can bake your own NGC algorithm. It recognizes a small image (pattern) into a big image. Then you can apply this algorithm for each character in the alphabet as I described it here. If you afford to do it in the background (not in real time), then it will work. The other solution is to bake your own neural network and train it - a bit more complex but doable. If you have the money, you can buy an existing library but there is none natively written for Delphi so you will end-up with a DLL. Commented Dec 11, 2024 at 15:42

3 Answers 3

13

You will have to head for a OCR (Optical Character Recognition) library. This is a pretty complex procedure, I believe you wouldn't be asking this question if you knew any way to implement this by yourself.

A quick Google yielded this result, maybe it's of help for you: http://www.planet-source-code.com/vb/scripts/ShowCode.asp?txtCodeId=1623&lngWId=7

Sign up to request clarification or add additional context in comments.

6 Comments

yes, you are right, the easier way is to find some tool that can do this work for me, but if anyone can point directions from where to start solving this problem manually by coding i'll be also greatful
@dzibul Are you serious? This is a frightfully hard problem that huge armies of exceedingly clever people have been trying to solve since computers were invented.
@dzibul if you have several man-years of free time and solid background in programming and academic knowledge of math, then you will find plenty of information about writing your own recognizer. Otherwise take an existing solution.
@david & @eugene: Yes, i know that is a big problem. I wondered if i can do that with cutting letters from picture, than comparing pixels from picture and from a sample letter. The right letter will be one that has the most pixels identical. Since the text from picture is machine text (not handwriting), i figured that it won't be so hard (but i appreciate the suggestion that i'm aiming high)
@dzibul - Maybe you'd like to have a look at 'SubRip's source. Developed with Delphi, it's a program that extracts subtitles (converts to text) from video streams. Since your letters are not handwritten a similar approach could help.
|
4

Look here:

https://forums.embarcadero.com/message.jspa?messageID=29331

1 Comment

...or better to say "maybe were"... The link is now dead.
1

Take a look at my answer about NeuroVCL OCR here. There is a lot of useful info and sample Delphi OCR DCU components.

Comments

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.