How to count words in a scanned text
Thread poster: Albana Dhimitri
Albana Dhimitri
Albana Dhimitri  Identity Verified
Greece
Local time: 06:52
Greek to Albanian
+ ...
Dec 2, 2009

Hi everybody,

I am negotiating translation of several PDF files, so I have to count the number of words, first.

I can convert the PDF files into .doc ones but, it won't be of any use, it seems, as the documents are scanned and word don't seem to count words in such documents.

Any ideas on what a solution could be?

Special thanks to any colleagues who can share their experience and have a productive day!


 
Albana Dhimitri
Albana Dhimitri  Identity Verified
Greece
Local time: 06:52
Greek to Albanian
+ ...
TOPIC STARTER
Adding to... Dec 2, 2009

Can scanned texts be opened (and translated) in Trados (the 2007 version)?

 
Epameinondas Soufleros
Epameinondas Soufleros  Identity Verified
Greece
Local time: 06:52
Member (2008)
English to Greek
+ ...
Manual count Dec 2, 2009

You can count them "manually". After all, this is one of the reasons you should charge extra for such texts.

As for translation of them, you need some form of OCR, but then again, the results are terrible and you need to spend a lot of time fixing things in the resulting document, so you'd be better off if you translated the text by hand and then type it in Word. Again, this is why you should charge extra. And in the case they need the layout of the original replicated, you can alwa
... See more
You can count them "manually". After all, this is one of the reasons you should charge extra for such texts.

As for translation of them, you need some form of OCR, but then again, the results are terrible and you need to spend a lot of time fixing things in the resulting document, so you'd be better off if you translated the text by hand and then type it in Word. Again, this is why you should charge extra. And in the case they need the layout of the original replicated, you can always charge extra for DTP services.

This is the way it goes: rubbish in, rubbish out. If they don't want the "rubbish out" part of the equation, they'll have to pay extra.
Collapse


 
Valery Kaminski
Valery Kaminski  Identity Verified
Belarus
Local time: 06:52
English to Russian
+ ...
Fine Reader Dec 2, 2009

It will scan the pdf text and then you can convert it to Word.

 
Derek Gill Franßen
Derek Gill Franßen  Identity Verified
Germany
Local time: 05:52
German to English
+ ...
In memoriam
OCR Dec 2, 2009

Get acquainted with OCR-software: http://en.wikipedia.org/wiki/Optical_character_recognition (there is also a list of such programs toward the end of the article).

Here are the steps involved:
1. Use OCR-software to convert the image (scan) into a document;
2. Count the words as you are accustomed, e.g., with Word Count in Word.

...
See more
Get acquainted with OCR-software: http://en.wikipedia.org/wiki/Optical_character_recognition (there is also a list of such programs toward the end of the article).

Here are the steps involved:
1. Use OCR-software to convert the image (scan) into a document;
2. Count the words as you are accustomed, e.g., with Word Count in Word.

Collapse


 
Albana Dhimitri
Albana Dhimitri  Identity Verified
Greece
Local time: 06:52
Greek to Albanian
+ ...
TOPIC STARTER
Thank you for helping! Dec 2, 2009

Hi,

Epaminondas, Valery and Derek. I really appreciate it! Thank you for the advice. I will study OCRs and their use and hope it helps.


 
VIBOL KEO
VIBOL KEO  Identity Verified
Local time: 10:52
Member (2009)
English to Khmer (Central)
TextBridge... Dec 2, 2009

Well, I used to see an expert who was working very closely to me practiced with TextBrdige for both numerical data and narrative ones.... I have often used it...

I think it would be helpful if you tired then you should count it within one of MS Office...


Great and Green Day...!


 
Jutta Deichselberger
Jutta Deichselberger  Identity Verified
Local time: 05:52
French to German
+ ...
Anycount Dec 2, 2009

Anyount 7 is able to count PDF's, too!

 
Erik Freitag
Erik Freitag  Identity Verified
Germany
Local time: 05:52
Member (2006)
Dutch to German
+ ...
No, it can't (AFAIK) Dec 2, 2009

Jutta Deichselberger wrote:

Anyount 7 is able to count PDF's, too!


But not if they're scanned, which is what the OP is asking for.

Edit: Apologies, I didn't know Anycount can do OCR now.

[Bearbeitet am 2009-12-02 16:38 GMT]


 
Jutta Deichselberger
Jutta Deichselberger  Identity Verified
Local time: 05:52
French to German
+ ...
Yes, it can:-) Dec 2, 2009

That is the reason why I upgraded from version 6 to version 7.
I already tested it several times, it seems to count a little bit less, maybe 5%...


 
Jutta Deichselberger
Jutta Deichselberger  Identity Verified
Local time: 05:52
French to German
+ ...
@ efreitag Dec 2, 2009

However, I do not know whether the PDF files were scanned oder not...:-((

 
Volodymyr Kukharenko
Volodymyr Kukharenko
Ukraine
Local time: 06:52
English to Ukrainian
+ ...
Anyount 7 can count text in images Dec 2, 2009

efreitag wrote:

Jutta Deichselberger wrote:

Anyount 7 is able to count PDF's, too!


But not if they're scanned, which is what the OP is asking for.


Yes if you save the scanned PDF into jpeg. Anyount 7 has OCR, so it can count image files


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How to count words in a scanned text






Trados Business Manager Lite
Create customer quotes and invoices from within Trados Studio

Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »
Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »