How to count words in a scanned text
Thread poster: Albana Dhimitri

Albana Dhimitri  Identity Verified
Greece
Local time: 09:01
Greek to Albanian
+ ...
Dec 2, 2009

Hi everybody,

I am negotiating translation of several PDF files, so I have to count the number of words, first.

I can convert the PDF files into .doc ones but, it won't be of any use, it seems, as the documents are scanned and word don't seem to count words in such documents.

Any ideas on what a solution could be?

Special thanks to any colleagues who can share their experience and have a productive day!


Direct link Reply with quote
 

Albana Dhimitri  Identity Verified
Greece
Local time: 09:01
Greek to Albanian
+ ...
TOPIC STARTER
Adding to... Dec 2, 2009

Can scanned texts be opened (and translated) in Trados (the 2007 version)?

Direct link Reply with quote
 

Epameinondas Soufleros  Identity Verified
Greece
Local time: 09:01
Member (2008)
English to Greek
+ ...
Manual count Dec 2, 2009

You can count them "manually". After all, this is one of the reasons you should charge extra for such texts.

As for translation of them, you need some form of OCR, but then again, the results are terrible and you need to spend a lot of time fixing things in the resulting document, so you'd be better off if you translated the text by hand and then type it in Word. Again, this is why you should charge extra. And in the case they need the layout of the original replicated, you can always charge extra for DTP services.

This is the way it goes: rubbish in, rubbish out. If they don't want the "rubbish out" part of the equation, they'll have to pay extra.


Direct link Reply with quote
 

Valery Kaminski  Identity Verified
Belarus
Local time: 10:01
Member (2006)
English to Russian
+ ...
Fine Reader Dec 2, 2009

It will scan the pdf text and then you can convert it to Word.

Direct link Reply with quote
 

Derek Gill Franßen  Identity Verified
Germany
Local time: 08:01
German to English
+ ...
OCR Dec 2, 2009

Get acquainted with OCR-software: http://en.wikipedia.org/wiki/Optical_character_recognition (there is also a list of such programs toward the end of the article).

Here are the steps involved:
1. Use OCR-software to convert the image (scan) into a document;
2. Count the words as you are accustomed, e.g., with Word Count in Word.



Direct link Reply with quote
 

Albana Dhimitri  Identity Verified
Greece
Local time: 09:01
Greek to Albanian
+ ...
TOPIC STARTER
Thank you for helping! Dec 2, 2009

Hi,

Epaminondas, Valery and Derek. I really appreciate it! Thank you for the advice. I will study OCRs and their use and hope it helps.


Direct link Reply with quote
 

VIBOL KEO  Identity Verified
Local time: 14:01
Member (2009)
English to Khmer (Central)
TextBridge... Dec 2, 2009

Well, I used to see an expert who was working very closely to me practiced with TextBrdige for both numerical data and narrative ones.... I have often used it...

I think it would be helpful if you tired then you should count it within one of MS Office...


Great and Green Day...!


Direct link Reply with quote
 

Jutta Deichselberger  Identity Verified
Local time: 08:01
Member (2004)
French to German
+ ...
Anycount Dec 2, 2009

Anyount 7 is able to count PDF's, too!

Direct link Reply with quote
 

Erik Freitag  Identity Verified
Germany
Local time: 08:01
Member (2006)
Dutch to German
+ ...
No, it can't (AFAIK) Dec 2, 2009

Jutta Deichselberger wrote:

Anyount 7 is able to count PDF's, too!


But not if they're scanned, which is what the OP is asking for.

Edit: Apologies, I didn't know Anycount can do OCR now.

[Bearbeitet am 2009-12-02 16:38 GMT]


Direct link Reply with quote
 

Jutta Deichselberger  Identity Verified
Local time: 08:01
Member (2004)
French to German
+ ...
Yes, it can:-) Dec 2, 2009

That is the reason why I upgraded from version 6 to version 7.
I already tested it several times, it seems to count a little bit less, maybe 5%...


Direct link Reply with quote
 

Jutta Deichselberger  Identity Verified
Local time: 08:01
Member (2004)
French to German
+ ...
@ efreitag Dec 2, 2009

However, I do not know whether the PDF files were scanned oder not...:-((

Direct link Reply with quote
 
Volodymyr Kukharenko
Ukraine
Local time: 09:01
Member (2009)
English to Ukrainian
+ ...
Anyount 7 can count text in images Dec 2, 2009

efreitag wrote:

Jutta Deichselberger wrote:

Anyount 7 is able to count PDF's, too!


But not if they're scanned, which is what the OP is asking for.


Yes if you save the scanned PDF into jpeg. Anyount 7 has OCR, so it can count image files


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How to count words in a scanned text

Advanced search






LSP.expert
You’re a freelance translator? LSP.expert helps you manage your daily translation jobs. It’s easy, fast and secure.

How about you start tracking translation jobs and sending invoices in minutes? You can also manage your clients and generate reports about your business activities. So you always keep a clear view on your planning, AND you get a free 30 day trial period!

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search