Handling PDF files
Thread poster: aruna yallapragada

aruna yallapragada  Identity Verified
Local time: 11:43
Member (2008)
German to English
+ ...
Jul 7, 2007

I find it difficult handling documents that have been scanned

as images. What is the best / fastest / easisest way to

convert them into editable word files? Is there any software

that does this conversion?


 

Andrej  Identity Verified
Local time: 09:13
Member (2005)
German to Russian
+ ...
- Jul 7, 2007

ABBY Finereader

 

Evija Rimšāne  Identity Verified
Latvia
Local time: 09:13
English to Latvian
... Jul 7, 2007

I believe this topic has been discussed here many many times. Please try to search the forums.
icon_smile.gif


 

aruna yallapragada  Identity Verified
Local time: 11:43
Member (2008)
German to English
+ ...
TOPIC STARTER
thank you Jul 7, 2007

Thank you. I did go through the forum. But it was not of
much help. I do have ABBY Finereader. Maybe I just don't knoiw how to go about it. I will try again.


 

Martin Wenzel
Germany
Local time: 08:13
English to German
+ ...
Omnipage15 Jul 7, 2007

I got a scanned in document just yesterday and Omnipage did a pretty good job.


It still leaves me with text boxes, but as long as I can trados inside these, I don't mind.


Aruna, you need to fiddle around with this progam and try various ways till you get the required results.

So don't give up on ABBYY Finereader too easily. I don't know it, but many translators say, it's a good program.


 

xxxBrandis
Local time: 08:13
English to German
+ ...
a possible work around Jul 7, 2007

Halo!
Convert images using acrobat reader 8.0, take a print, rescan and then do the OCR. The editing time involved is lot lesser and you get more clear processable output. Well this applies only if the source file is a picture like a printed brochure converted into an image using scanning process. Such images cannot be effectively OCRed with any known OCR software effectively, there is still much manual editing work involved before getting a satisfactorily processable file in word. Brandis

[Edited at 2007-07-07 16:15]


 

Natalie  Identity Verified
Poland
Local time: 08:13
Member (2002)
English to Russian
+ ...

MODERATOR
Hi Brandis, I disagree with your suggestion Jul 7, 2007

Images are opened directly in ABBY Finereader and converted to the text. Why on earth converting, printing, rescanning?

 

VIBOL KEO  Identity Verified
Local time: 13:13
Member (2009)
English to Khmer (Central)
PDF, Scanned, and Conversion?! Jul 9, 2007

Ladies and Gentlemen:

Trying TextBridge and/or pdf2word softwares may be helful.

Good Luck!

Vibol Keo

[Edited at 2007-07-09 11:50]


 

Miroslav Jeftic  Identity Verified
Local time: 08:13
English to Serbian
+ ...
Anything better than Abbyy? Sep 19, 2007

Can anyone suggest better solution than Abbyy? I'm using it for quite some time now, but it gave me headache more than once. For examaple, if I have tables with pictures inside, I find it impossible to have the correct output directly. Drawing a picture block inside table block never produces picture in the output. Usually I have to first convert table without picture, then picture without table, then insert picture into document with the table. When I have a small document that's just a minor thing, but recently I had two pdf files with totally around 700 pages, so it was nigthmare now and again. Also if the text blocks are too big, sometimes bullets and such things disappear from the output, though they were displayed correctly in the text window. Messed up fonts and strange tabulations too are also a bother.
However, I haven't found better option yet, those smaller convertors are completely useless most of the time, and Omnipage didn't give me enough editing options. Abbyy's result I can at least correct manually.


 

Ramon Somoza  Identity Verified
Spain
Local time: 08:13
Member (2002)
Dutch to Spanish
+ ...
Depends on what you want.... Sep 22, 2007

Abby Finereader is a mess to use, but is invaluable if the PDF text has been scanned or faxed and is therefore stored within the PDF as an image, due to its built-in OCR.

Alternatively, if the PDF text is actually stored as text (you can check by trying to select the text in the PDF file), a much better conversion tool is Solid Converter (http://www.solidpdf.com/). It is not very expensive in its standard version and works like a breeze, maintaining the layout much better than Abby Finereader once you know how it works. It does not convert text stored as an image to text, however. Dowload a trial version from their web site and test it yourself.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Handling PDF files

Advanced search






TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
SDL Trados Studio 2017 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2017 helps translators increase translation productivity whilst ensuring quality. Combining translation memory, terminology management and machine translation in one simple and easy-to-use environment.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search