Search found 3 matches

by benedikt
Tue Sep 18, 2018 3:06 pm
Forum: PDF
Topic: Get text from pdf including spaces in the PDF document
Replies: 3
Views: 3258

Re: Get text from pdf including spaces in the PDF document

As you can read in my post, I'm already using GetPageText with the latest release. But I don't get any spaces larger then one.
So the visual space is not "filled" with spacechars.
by benedikt
Wed Sep 12, 2018 9:15 am
Forum: PDF
Topic: Get text from pdf including spaces in the PDF document
Replies: 3
Views: 3258

Get text from pdf including spaces in the PDF document

Hey,

is it possible to get the formatted text from a PDF? For example a line in the pdf looks like:

Code: Select all

1  Test t3            3,5   14
But the result of "GetPageText" is:

Code: Select all

1 Test t3 3,5 14
I need the space information to split a line into columns.

Thanks a lot!
by benedikt
Wed Aug 08, 2018 12:01 pm
Forum: OCR, MRC & Document understanding
Topic: NullReferenceException when doing PDF OCR
Replies: 11
Views: 16768

Re: NullReferenceException when doing PDF OCR

I've got this issue when disposing the imaging and pdf instance before the ocr process finished. My solution for now was to set the sync option to true: Last parameter here: pdfInstance.OcrPages("*", 0, language, GdPictureHelper.OCRDirectory, "", resolution, 0, true); Complete co...