What is a good OCR software for Japanese?
Thread poster: DavidCanek
Could anyone recommend a good software product to OCR a bunch of PDFs in Japanese?
| | DZiW
English to Russian
The problem with hieroglyphs is they usually require 300+ DPI...
So, ABBYY FineReader might be of help. Although I don't use it for Oriental languages , but it proved to be one of the best OCR's atm.
| More OCR options for JA || Feb 9, 2012 |
First, there's 読んde!!ココ v.13. (Windows only.) Here's the basic info:
Buy it here:
Works with TWAIN scanners and WIA scanners, will play nicely all the way to Win 7 64.
It claims to handle smudged kanji and underlined words, and has a learning mode. Plus, it has a bunch of built-in dictionaries to help with recognition. It claims to be able to handle both kana, kanji, and alphanumeric text on the same page as well, something that ReadIris choked on frequently when I used it.
If it does what it claims to, then it would be a heck of a lot better than anything IRIS puts out, for a lot cheaper. ~13,000 yen for the full download version. 20,000 yen if you want a box. Interface is all Japanese.
The other software I'm looking at is e.Typist (Windows only, supports Mac via Boot Camp... I think. It's vague about Mac support.) :
Main Page-- details along the sidebar links:
Try the Eval version here:
Buy the Download version for cheap here:
http://shop.mediadrive.jp/item_list.htm … p;request=
Looks pretty similar to 読んde!!ココ, feature-wise, with a few notable exceptions. First, you can buy the Neo edition which only does EN and JP for 9800 yen (download), or the standard edition which does a bunch of languages for ~13000 yen (download). If you buy at a store, expect to pay 13,000/20,000 yen. Discounts for downloading are nice here, just like 読んde!!ココ. The Neo feature is nice if you don't care about other languages.
Otherwise, it seems to do just about everything that 読んde!!ココ does, with a few exceptions. First, it has a "preview mode" where it superimposes what it thinks it sees over the text it scans, so you can correct it. Also, it doesn't say whether it supports WIA scanners. It's vague about that. It says it supports Win 7 64, but it's kind of sketchy about which scanners it supports. I guess I'll try the eval version first to see if it likes my Brother MFC 7840.
Both handle image files, PDFs, scans, photos, and various input devices, and will output to txt, rtf, excel, word, etc., with some variations between the two. Check the websites to see if your flavors are supported.
Both have large dictionaries, and it looks like both support learning modes for Japanese, which ReadIris does not.
And if you want Free Japanese OCR, there's this thread here:
| | DavidCanek
Local time: 16:06
English to Czech
Thanks for all the tips!
| Best, cheapest OCR software for Japanese || Jul 17, 2012 |
I searched the Internet for several days trying to find a good OCR software for Japanese for MacIntosh (I now have OS X version 10.6.8).
It was very frustrating ... many of the software specifications were totally inadequate for deciding whether the software would do what I needed, which was OCR for scanned PCR files with vertically oriented Japanese, 150 dpi resolution, B/W contrast not the best). The software that looked most promising was available only for Windows.
Finally, I found that I already owned the best (or at least highly adequate) OCR software. It is Abobe Acrobat X Pro (my version is 10.1.3). It rapidly converts whole pages of Japanese text image (PDF scan) to copiable and editable text. It works in vertical orientation (and I assume also horizontal orientation). I've just tested it a bit, and it was 100% accurate. Maybe with further testing it will prove to be somewhat less than 100%, but it was astounding. It does not do translation, but I can copy and paste into translation programs.
I say that this is the cheapest OCR software, because I already had Abobe Acrobat X Pro. I needed it for other things, and finding that it had OCR capability was essentially free, in my case. However, Abobe Acrobat X Pro is probably cheaper anyway than equally good PCR conversion software.
Here is an excellent link telling you exactly how to use the OCR converter function in Abobe Acrobat X Pro:
Here is the translation software I use ("detailed word info" option):
I hope I have helped people who want OCR for Japanese.
To report site rules violations or get help, contact a site moderator:
You can also contact site staff by submitting a support request »
What is a good OCR software for Japanese?
|Protemos translation business management system |
|Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!|
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.
More info »
|Translation Memory Software for Any Platform|
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
More info »