| User | Thread poster: Jan Sundström Optimize scanned PDF - clever new function in Acrobat 8! |
Jan Sundström Sweden Local time: 02:25 English to Swedish + ... |
Hi all,
I wonder if you discovered this hidden gem in Acrobat 8 yet:
Document > Optimize scanned PDF
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY.
An added benefit is that the file size shrinks drastically (my test PDF shrunk from 14MB to 800KB).
Did anyone else play around with the settings? Did your results improve?
/Jan | | | |
Samuel Murray Netherlands Local time: 02:25
Member (2006) English to Afrikaans + ... |
J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY. |
|
Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove. | | | |
Andrzej Lejman Poland Local time: 02:25
 Member (2004) German to Polish + ... |
Samuel Murray wrote:
J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY. |
|
Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.
|
|
You can always optimise a copy of the document...
Regards
A. | | | |
Jan Sundström Sweden Local time: 02:25 English to Swedish + ... TOPIC STARTER |
Andrzej Lejman wrote:
Samuel Murray wrote:
J-a-n S-ndstr-m wrote:
It cleans up the dirt/noise and prepares the text for OCR with an external application like ABBYY. |
|
Ideally, your OCR program should also be configurable to ignore a certain amount of noise. The potential problem with Adobe doing the optimisation is that you can't "deoptimise" the PDF when you discover that Adobe mis-guessed which bits to remove.
|
|
You can always optimise a copy of the document...
|
|
Still, I found the options in Acrobat 8 to be very versatile!
You can select and customize for background images, anti-moiré effect and several other levers where you can adjust the strength of the effect too. I haven't seen this in any other OCR program so far.
/Jan | | | |
Noe Tessmann Germany Local time: 02:25 English to German + ... | | missng bits and pieces | Jan 10, 2008 |
oh it must be this feature. I scanned into Adobe and ended up with missing words in the text. Is this optimizing?
Regards
Noe | | | |