GetPageText() on PDF/A
GetPageText() on PDF/A
Hello,
I want to get the text of a PDF/A-1b document and the method GetPageText() return specials caracters. The result is not alphanumeric. I have the same problem with the method GetPageContent().
When I try with a simple PDF I don't have this problem.
Thanks for help.
Cécile
I want to get the text of a PDF/A-1b document and the method GetPageText() return specials caracters. The result is not alphanumeric. I have the same problem with the method GetPageContent().
When I try with a simple PDF I don't have this problem.
Thanks for help.
Cécile
Re: GetPageText() on PDF/A
Hi Cecile,
Thank you for your interest in our products.
May I ask you to share the file you are using? I need this file to reproduce the issue.
I'm looking forward to hearing from you.
Regards,
David
Thank you for your interest in our products.
May I ask you to share the file you are using? I need this file to reproduce the issue.
I'm looking forward to hearing from you.
Regards,
David
Re: GetPageText() on PDF/A
Ok, How can I send you the pdf file ?
Re: GetPageText() on PDF/A
Cecile,
You can attach the PDF to this ticket.
Regards,
David
You can attach the PDF to this ticket.
Regards,
David
Re: GetPageText() on PDF/A
ok sorry.
- Attachments
-
- doc.zip
- PDF/A document
- (47.8 KiB) Downloaded 296 times
Re: GetPageText() on PDF/A
Hello Cécile,
Unfortunately this PDF is generated without embedding correct font encoding (aka cMap or difference table). So there is basically nothing that can be done to associate each rendered glyph to the correct character ID. You will obtain the same result with Adobe reader: select the text, copy it (ctrl+c) and past it in notepad.
Please let us know if you need further information.
With best regards,
Loïc
Unfortunately this PDF is generated without embedding correct font encoding (aka cMap or difference table). So there is basically nothing that can be done to associate each rendered glyph to the correct character ID. You will obtain the same result with Adobe reader: select the text, copy it (ctrl+c) and past it in notepad.
Please let us know if you need further information.
With best regards,
Loïc
Re: GetPageText() on PDF/A
ok,
thank you for your help.
Cécile
thank you for your help.
Cécile
Who is online
Users browsing this forum: No registered users and 1 guest