How to get accurate recognitions on letter o and number 0?
Re: How to get accurate recognitions on letter o and number 0?
Hi,
Could you send me this image at esupport (at) gdpicture (dot) com ?
I'll see if I can do something.
Best regards,
Loïc
Could you send me this image at esupport (at) gdpicture (dot) com ?
I'll see if I can do something.
Best regards,
Loïc
Re: How to get accurate recognitions on letter o and number 0?
Hi Guepin,
I alredy sent you an answer by mail. Maybe it it in your anti-spam box ?
Here a copy of my answer:
I alredy sent you an answer by mail. Maybe it it in your anti-spam box ?
Here a copy of my answer:
LoïcHi,
Your text are written using a police which is not OCR friendly.
However, you can get better result converting the image in 1bpp before doing OCR process.
Also, you can call the FxDilate4 method before processing OCR. it can repair some characters.
Best regards,
Loïc
Re: How to get accurate recognitions on letter o and number 0?
Hi,
You have probably loaded the image within an Imaging object to performs OCR.
Before calling the OCR function you can call the ConverTo1Bpp() method.
Also, you can try to repair characters calling the FxDilate4 () (FxBitonalDilate4() for .NET).
IE:
Using ActiveX editions:
Using .NET editions:
Then do OCR here.
With best regards,
Loïc
You have probably loaded the image within an Imaging object to performs OCR.
Before calling the OCR function you can call the ConverTo1Bpp() method.
Also, you can try to repair characters calling the FxDilate4 () (FxBitonalDilate4() for .NET).
IE:
Using ActiveX editions:
Code: Select all
Imaging1.SetNativeImage(GdViewer1.GetNativeImage())
Imaging1.ConvertTo1Bpp()
Imaging1.FxDilate4()
Code: Select all
oGdPictureImaging.ConvertTo1Bpp(m_ImageID)
oGdPictureImaging.FxBitonalDilate4(m_ImageID)
Then do OCR here.
With best regards,
Loïc
Re: How to get accurate recognitions on letter o and number 0?
Hi,
Sorry for the late but I thought this topic was closed. Please, next time start a new topic for a new question.
The implementation is done for the .NET edition and should be released within 2 weeks.
However, we are looking for a way to introduce this feature in the ActiveX editions but today I can't promise a date of release, we encountering too many problem with the PDF engine we are using.
Best regards,
Loïc
Sorry for the late but I thought this topic was closed. Please, next time start a new topic for a new question.
Unfortunately not. OCR is not an exact science. You can try to optimize OCR accuracy applying some filters to your image such as character repair, erosion, dilation... See the "Bitonal Image enhancement function" section of the reference guide of the toolkit to get an exhaustive list of functions.It seems to have fixed the o versus 0 problem. However, it introduced another problem: the letter e was now incorrectly OCRed to c.
Any ideas to fix the new problem?
This feature is not implemented into the GdPicture ActiveX editions.Can I use Gdpicture's pdf functions to do a search for the texts after completing the OCR process? If possible, please show me how. The more detailed the more better. I'd like to build an interface with VB where the user can search the texts in the pdf file.
The implementation is done for the .NET edition and should be released within 2 weeks.
However, we are looking for a way to introduce this feature in the ActiveX editions but today I can't promise a date of release, we encountering too many problem with the PDF engine we are using.
Best regards,
Loïc
Who is online
Users browsing this forum: No registered users and 1 guest