Handling PDF files
Thread poster: aruna yallapragada
aruna yallapragada  Identity Verified
Local time: 18:27
Member (2008)
German to English
+ ...
Jul 7, 2007

I find it difficult handling documents that have been scanned

as images. What is the best / fastest / easisest way to

convert them into editable word files? Is there any software

that does this conversion?


Direct link Reply with quote
 

Andrej  Identity Verified
Local time: 15:57
Member (2005)
German to Russian
+ ...
- Jul 7, 2007

ABBY Finereader

Direct link Reply with quote
 

Evija Rimšāne  Identity Verified
Latvia
Local time: 15:57
Member
English to Latvian
... Jul 7, 2007

I believe this topic has been discussed here many many times. Please try to search the forums.


Direct link Reply with quote
 
aruna yallapragada  Identity Verified
Local time: 18:27
Member (2008)
German to English
+ ...
TOPIC STARTER
thank you Jul 7, 2007

Thank you. I did go through the forum. But it was not of
much help. I do have ABBY Finereader. Maybe I just don't knoiw how to go about it. I will try again.


Direct link Reply with quote
 

Martin Wenzel
Germany
Local time: 14:57
English to German
+ ...
Omnipage15 Jul 7, 2007

I got a scanned in document just yesterday and Omnipage did a pretty good job.


It still leaves me with text boxes, but as long as I can trados inside these, I don't mind.


Aruna, you need to fiddle around with this progam and try various ways till you get the required results.

So don't give up on ABBYY Finereader too easily. I don't know it, but many translators say, it's a good program.


Direct link Reply with quote
 
xxxBrandis
Local time: 14:57
English to German
+ ...
a possible work around Jul 7, 2007

Halo!
Convert images using acrobat reader 8.0, take a print, rescan and then do the OCR. The editing time involved is lot lesser and you get more clear processable output. Well this applies only if the source file is a picture like a printed brochure converted into an image using scanning process. Such images cannot be effectively OCRed with any known OCR software effectively, there is still much manual editing work involved before getting a satisfactorily processable file in word. Brandis

[Edited at 2007-07-07 16:15]


Direct link Reply with quote
 

Natalie  Identity Verified
Poland
Local time: 14:57
Member (2002)
English to Russian
+ ...

MODERATOR
Hi Brandis, I disagree with your suggestion Jul 7, 2007

Images are opened directly in ABBY Finereader and converted to the text. Why on earth converting, printing, rescanning?

Direct link Reply with quote
 

VIBOL KEO  Identity Verified
Local time: 19:57
Member (2009)
English to Khmer (Central)
PDF, Scanned, and Conversion?! Jul 9, 2007

Ladies and Gentlemen:

Trying TextBridge and/or pdf2word softwares may be helful.

Good Luck!

Vibol Keo

[Edited at 2007-07-09 11:50]


Direct link Reply with quote
 
Miroslav Jeftic  Identity Verified
Local time: 14:57
English to Serbian
+ ...
Anything better than Abbyy? Sep 19, 2007

Can anyone suggest better solution than Abbyy? I'm using it for quite some time now, but it gave me headache more than once. For examaple, if I have tables with pictures inside, I find it impossible to have the correct output directly. Drawing a picture block inside table block never produces picture in the output. Usually I have to first convert table without picture, then picture without table, then insert picture into document with the table. When I have a small document that's just a minor thing, but recently I had two pdf files with totally around 700 pages, so it was nigthmare now and again. Also if the text blocks are too big, sometimes bullets and such things disappear from the output, though they were displayed correctly in the text window. Messed up fonts and strange tabulations too are also a bother.
However, I haven't found better option yet, those smaller convertors are completely useless most of the time, and Omnipage didn't give me enough editing options. Abbyy's result I can at least correct manually.


Direct link Reply with quote
 

Ramon Somoza  Identity Verified
Spain
Local time: 14:57
Member (2002)
Dutch to Spanish
+ ...
Depends on what you want.... Sep 22, 2007

Abby Finereader is a mess to use, but is invaluable if the PDF text has been scanned or faxed and is therefore stored within the PDF as an image, due to its built-in OCR.

Alternatively, if the PDF text is actually stored as text (you can check by trying to select the text in the PDF file), a much better conversion tool is Solid Converter (http://www.solidpdf.com/). It is not very expensive in its standard version and works like a breeze, maintaining the layout much better than Abby Finereader once you know how it works. It does not convert text stored as an image to text, however. Dowload a trial version from their web site and test it yourself.


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Handling PDF files

Advanced search






SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »
Across v6.3
Translation Toolkit and Sales Potential under One Roof

Apart from features that enable you to translate more efficiently, the new Across Translator Edition v6.3 comprises your crossMarket membership. The new online network for Across users assists you in exploring new sales potential and generating revenue.

More info »



All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs