ProZ.com global directory of translation services
 The translation workplace
Ideas

 
User
Thread poster: Jan Sundström
Optimize scanned PDF - clever new function in Acrobat 8!

Jan Sundström
Sweden
Local time: 02:25
English to Swedish
+ ...
Jan 3, 2008

Hi all,

I wonder if you discovered this hidden gem in Acrobat 8 yet:
Document > Optimize scanned PDF

It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.
An added benefit is that the file size shrinks drastically (my test PDF shrunk from 14MB to 800KB).

Did anyone else play around with the settings? Did your results improve?

/Jan


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 02:25
Member (2006)
English to Afrikaans
+ ...
Ideally... Jan 3, 2008


J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.


Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.


Direct link Reply with quote
 

Andrzej Lejman  Identity Verified
Poland
Local time: 02:25
Member (2004)
German to Polish
+ ...
Solution Jan 3, 2008


Samuel Murray wrote:


J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.


Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.


You can always optimise a copy of the document...

Regards

A.


Direct link Reply with quote
 

Jan Sundström
Sweden
Local time: 02:25
English to Swedish
+ ...
TOPIC STARTER
Exactly... Jan 3, 2008


Andrzej Lejman wrote:


Samuel Murray wrote:


J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.


Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.


You can always optimise a copy of the document...



Still, I found the options in Acrobat 8 to be very versatile!

You can select and customize for background images, anti-moiré effect and several other levers where you can adjust the strength of the effect too. I haven't seen this in any other OCR program so far.

/Jan


Direct link Reply with quote
 
Noe Tessmann  Identity Verified
Germany
Local time: 02:25
English to German
+ ...
missng bits and pieces Jan 10, 2008

oh it must be this feature. I scanned into Adobe and ended up with missing words in the text. Is this optimizing?

Regards

Noe


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Natalie[Call to this topic]
Mohamed Kamel[Call to this topic]

You can also contact site staff by submitting a support request »

Optimize scanned PDF - clever new function in Acrobat 8!






Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »
Fluency Translation Suite 2011
Translate Up To 50% Faster with Fluency

Start and finish your translations faster than ever with Fluency Translation Suite 2011. TMs, Terminology, and Online Resources are all fully integrated and only a click away. Download a free trial today!

More info »