Optimize scanned PDF - clever new function in Acrobat 8!
Thread poster: Jan Sundström

Jan Sundström  Identity Verified
Sweden
Local time: 02:45
English to Swedish
+ ...
Jan 3, 2008

Hi all,

I wonder if you discovered this hidden gem in Acrobat 8 yet:
Document > Optimize scanned PDF

It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.
An added benefit is that the file size shrinks drastically (my test PDF shrunk from 14MB to 800KB).

Did anyone else play around with the settings? Did your results improve?

/Jan


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 02:45
Member (2006)
English to Afrikaans
+ ...
Ideally... Jan 3, 2008

J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.


Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.


Direct link Reply with quote
 

Andrzej Lejman  Identity Verified
Local time: 02:45
German to Polish
+ ...
Solution Jan 3, 2008

Samuel Murray wrote:

J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.


Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.


You can always optimise a copy of the document...

Regards

A.


Direct link Reply with quote
 

Jan Sundström  Identity Verified
Sweden
Local time: 02:45
English to Swedish
+ ...
TOPIC STARTER
Exactly... Jan 3, 2008

Andrzej Lejman wrote:

Samuel Murray wrote:

J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.


Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.


You can always optimise a copy of the document...



Still, I found the options in Acrobat 8 to be very versatile!

You can select and customize for background images, anti-moiré effect and several other levers where you can adjust the strength of the effect too. I haven't seen this in any other OCR program so far.

/Jan


Direct link Reply with quote
 
Noe Tessmann  Identity Verified
Local time: 02:45
English to German
+ ...
missng bits and pieces Jan 10, 2008

oh it must be this feature. I scanned into Adobe and ended up with missing words in the text. Is this optimizing?

Regards

Noe


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Optimize scanned PDF - clever new function in Acrobat 8!

Advanced search






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search