i need an advice on how to approach this problem. I have some picture data: *.jpg, *.bmp ... and i need to extract the data from it. The data is alphanumeric text. I work in delphi.
- You can bake your own NGC algorithm. It recognizes a small image (pattern) into a big image. Then you can apply this algorithm for each character in the alphabet as I described it here. If you afford to do it in the background (not in real time), then it will work. The other solution is to bake your own neural network and train it - a bit more complex but doable. If you have the money, you can buy an existing library but there is none natively written for Delphi so you will end-up with a DLL.Error - CPU Not Foud– Error - CPU Not Foud2024-12-11 15:42:23 +00:00Commented Dec 11, 2024 at 15:42
Add a comment |
3 Answers
You will have to head for a OCR (Optical Character Recognition) library. This is a pretty complex procedure, I believe you wouldn't be asking this question if you knew any way to implement this by yourself.
A quick Google yielded this result, maybe it's of help for you: http://www.planet-source-code.com/vb/scripts/ShowCode.asp?txtCodeId=1623&lngWId=7
6 Comments
dzibul
yes, you are right, the easier way is to find some tool that can do this work for me, but if anyone can point directions from where to start solving this problem manually by coding i'll be also greatful
David Heffernan
@dzibul Are you serious? This is a frightfully hard problem that huge armies of exceedingly clever people have been trying to solve since computers were invented.
Eugene Mayevski 'Callback
@dzibul if you have several man-years of free time and solid background in programming and academic knowledge of math, then you will find plenty of information about writing your own recognizer. Otherwise take an existing solution.
dzibul
@david & @eugene: Yes, i know that is a big problem. I wondered if i can do that with cutting letters from picture, than comparing pixels from picture and from a sample letter. The right letter will be one that has the most pixels identical. Since the text from picture is machine text (not handwriting), i figured that it won't be so hard (but i appreciate the suggestion that i'm aiming high)
Sertac Akyuz
@dzibul - Maybe you'd like to have a look at 'SubRip's source. Developed with Delphi, it's a program that extracts subtitles (converts to text) from video streams. Since your letters are not handwritten a similar approach could help.
|
1 Comment
TLama
...or better to say "maybe were"... The link is now dead.
Take a look at my answer about NeuroVCL OCR here. There is a lot of useful info and sample Delphi OCR DCU components.