Hi,
Could anyone please list the languages which supports OCR
Thank You,
Harry
OCR Languages
-
- Posts: 352
- Joined: Tue Sep 27, 2011 11:47 am
Re: OCR Languages
Hi,
For V8 supported languages and older, you need to download the language pack:
ttp://www.gdpicture.com/download/ocr_language_pack.zip
There is a list of language that are supported in GdPicture V9:
Arabic language data: ara.traineddata, ara.cube.bigrams, ara.cube.fold, ara.cube.lm, ara.cube.nn, ara.cube.params, ara.cube.size, ara.cube.word-freq
Bulgarian language data: bul.traineddata
Catalan language data: cat.traineddata
Czech language data: ces.traineddata
Chinese (Simplified) language data: chi_sim.traineddata
Chinese (Traditional) language data: chi_tra.traineddata
Cherokee language data: chr.traineddata
Danish language data: dan.traineddata
Danish (Fraktur) language data: dan-frak.traineddata
German language data: deu.traineddata
Fraktur Language data (Old German) : deu-frak.traineddata
Greek language data: ell.traineddata
English language data: eng.traineddata
Finnish language data: fin.traineddata
French language data: fra.traineddata
Hebrew language data: heb.traineddata
Hindi language data: hin.traineddata, hin.cube, hin.cube.fold, hin.cube.lm, hin.cube.nn, hin.cube.params, hin.cube.word-freq, hin.tesseract_cube.nn,
Hungarian language data : hun.traineddata
Indonesian language data: ind.traineddata
Italian language data: ita.traineddata
Japanese language data: jpn.traineddata
Korean language data: kor.traineddata
Latvian language data: lav.traineddata
Lithuanian language data: lit.traineddata
Dutch language data: nld.traineddata
Norwegian language data: nor.traineddata
Polish language data: pol.traineddata
Portuguese language data: por.traineddata
Romanian language data: ron.traineddata
Russian language data: rus.traineddata
Slovakian language data: slk.traineddata
Slovakian Fraktur Language data: slk-frak.traineddata
Slovenian language data: slv.traineddata
Spanish language data: spa.traineddata
Serbian (Latin) language data: srp.traineddata
Swedish language data: swe.traineddata
Swedish (Fraktur) language data: swe-frak.traineddata
Tagalog language data: tgl.traineddata
Thai language data: tha.traineddata
Turkish language data: tur.traineddata
Ukrainian language data: ukr.traineddata
Vietnamese language data: vie.traineddata
Best Regards,
Sami
For V8 supported languages and older, you need to download the language pack:
ttp://www.gdpicture.com/download/ocr_language_pack.zip
There is a list of language that are supported in GdPicture V9:
Arabic language data: ara.traineddata, ara.cube.bigrams, ara.cube.fold, ara.cube.lm, ara.cube.nn, ara.cube.params, ara.cube.size, ara.cube.word-freq
Bulgarian language data: bul.traineddata
Catalan language data: cat.traineddata
Czech language data: ces.traineddata
Chinese (Simplified) language data: chi_sim.traineddata
Chinese (Traditional) language data: chi_tra.traineddata
Cherokee language data: chr.traineddata
Danish language data: dan.traineddata
Danish (Fraktur) language data: dan-frak.traineddata
German language data: deu.traineddata
Fraktur Language data (Old German) : deu-frak.traineddata
Greek language data: ell.traineddata
English language data: eng.traineddata
Finnish language data: fin.traineddata
French language data: fra.traineddata
Hebrew language data: heb.traineddata
Hindi language data: hin.traineddata, hin.cube, hin.cube.fold, hin.cube.lm, hin.cube.nn, hin.cube.params, hin.cube.word-freq, hin.tesseract_cube.nn,
Hungarian language data : hun.traineddata
Indonesian language data: ind.traineddata
Italian language data: ita.traineddata
Japanese language data: jpn.traineddata
Korean language data: kor.traineddata
Latvian language data: lav.traineddata
Lithuanian language data: lit.traineddata
Dutch language data: nld.traineddata
Norwegian language data: nor.traineddata
Polish language data: pol.traineddata
Portuguese language data: por.traineddata
Romanian language data: ron.traineddata
Russian language data: rus.traineddata
Slovakian language data: slk.traineddata
Slovakian Fraktur Language data: slk-frak.traineddata
Slovenian language data: slv.traineddata
Spanish language data: spa.traineddata
Serbian (Latin) language data: srp.traineddata
Swedish language data: swe.traineddata
Swedish (Fraktur) language data: swe-frak.traineddata
Tagalog language data: tgl.traineddata
Thai language data: tha.traineddata
Turkish language data: tur.traineddata
Ukrainian language data: ukr.traineddata
Vietnamese language data: vie.traineddata
Best Regards,
Sami
Re: OCR Languages
Additional information: the latest updated list can be found in the reference guide (starting GdPicture.NET 9) / Appendix / Tesseract OCR Language Dictionaries
see: https://www.gdpicture.com/guides/gdpicture
see: https://www.gdpicture.com/guides/gdpicture
Who is online
Users browsing this forum: No registered users and 1 guest