Urgent!!!! Pdf convert to word or txt. Chinese character support?
Thread poster: Yuting Xiao

Yuting Xiao  Identity Verified
United Kingdom
Local time: 19:25
English to Chinese
+ ...
Apr 10, 2007

I downloaded ABBYY FineRead, hoping it can easily convert my scanned pdf files into word or some sort.
But it turned out I need to have the extended package support ifor Chinese lanuage recognisation.....and the extended package is only supplied to company users by ABBYY. obviously I am not a company user.
Anyone has any suggestions? or any other tools complying with Chinese?
Thanks in advance.


Mulyadi Subali  Identity Verified
Local time: 02:25
English to Indonesian
+ ...
Easy PDF to Text Converter Apr 11, 2007

i don't know whether this supports chinese character or not, but it's free and so far it's been pretty good. the info page is here: http://www.pdf-to-html-word.com/pdf-to-text/


Katrin Hollberg  Identity Verified
Local time: 20:25
Japanese to German
+ ...
Check out this software (works fine with Japanese documents) Apr 11, 2007


I can really really recommend this solution.
I just checked out the trial version and it works fine with pdfs created in the Japanese language. As I noticed this software also supports Chinese.

(You can just download from their site and use for several pages and then decide whether it is convenient for you)


Have a try...it is really nice and cheap




Yuting Xiao  Identity Verified
United Kingdom
Local time: 19:25
English to Chinese
+ ...
Thanks all! Apr 12, 2007

I finally sorted it out yesterday using a chinese tool for the scanned pdf recognition. I tried those two tools recommended by Katrin and Mulyadi first but turned out they were not working for the scanned filesicon_frown.gif.
Anyway, thanks for both of you.


Katrin Hollberg  Identity Verified
Local time: 20:25
Japanese to German
+ ...
What a pity, Rachel... Apr 12, 2007

Sorry for not being helpful but of course it does not work with pdfs which have not been converted from a standard text file (e.g. Word...).

I also tried a pdf file the other day which finally turned out originally being a bmp file converted into pdf. In that case I guess no converting tool is able to retrieve any information out of a picture file because all relating information data gets lost during the processes.

In those cases probably only a good OCR-software is finally helpful in order to prevent you from typing a document manually for a translation process with Trados and other tools.

I just found one for Japanese characters ("YONDE! Koko") and have in mind to buy very soon. I am sure there must be a similar one for Chinese characters. Actually it is the same fonts' nature.

I would try asking the customer whether they could not send you an original text file. If I face any troubles with those "originally produced" files I can also use my Japanese OS + Office and fix any compatibility problems in most cases. But not every agency is aware of these possibilities - depending on their general experience with Chinese/Japanese fonts.

Nevertheless - Good luck for your project


[Bearbeitet am 2007-04-12 08:27]


Jan Sundström  Identity Verified
Local time: 20:25
English to Swedish
+ ...
Other OCR programs Apr 16, 2007

Hi all,

The sticking pont seems to be finding an OCR program that has all these features:
- can OCR a flattened PDF, or unlayered image file like JPG, TIFF
- preferably comes with a western UI (for easy handling by non-chinese users)
- is available unbundled for individual users at a reasonable price

Search the forum for previous suggestions. The concensus seems to be that the best OCR for Chinese is Hanwang. But I still don't know if it has any English UI!
Product page here (it seems that it's now available standalone too, not just bundled):

Mentioned here:

"Hanwang (汉王), Shangshu (尚书) and 清华紫光 are top ones in the industry. Where Hanwang is bundled solely with Hanwang Scanners, whereas Shangshu is not scanner-dependant, but sold as a bundle with Microtek series. 清华紫光s are for Qinghua series.

Amongst them I like Hanwang best. Their website:
Hanwang: http://www.hw99.com/

Shangshu is also part of Hanwang, with limited features."

Other suggestions:


convert pdf (chinese words) into text Apr 29, 2007

hello to rachel and all,

Which chinese tool can be used for the scanned pdf recognition? and how to used it, can teach? i do scanned my chinese document but cant convert it into text, it appear as a image. meant i need to retype whole document!! anyone can help please? tq in advanced.


Angeline PhD  Identity Verified
Local time: 03:25
English to Chinese
+ ...
Try this. May 27, 2007


For Chinese character, It is better than other ocr software.

display output error Aug 25, 2012

Hi, tried Hanwang, it able to recognised, but when convert to text or words, it shown simbol instead of chinese charactor. Anybody encounter this? What setting should I do?


To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

Urgent!!!! Pdf convert to word or txt. Chinese character support?

Advanced search

BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »

  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search