Getting started with converted PDF
Thread poster: Andy S
Andy S
Andy S
United Kingdom
Local time: 17:16
German to English
Nov 26, 2008

I cannot even get off the ground with my project as there is no file showing up in OmegaT when I try to load the project.

I converted a PDF to Word and then to .ODF. However it still looks like a PDF in OO Writer and Word, if you see what I mean - it is not editable.

I suspect this is the reason for the problem - I understand there are different types of PDFs, some so-called "image" files, or some such, which are not editable and therefore no good for OmegaT.

... See more
I cannot even get off the ground with my project as there is no file showing up in OmegaT when I try to load the project.

I converted a PDF to Word and then to .ODF. However it still looks like a PDF in OO Writer and Word, if you see what I mean - it is not editable.

I suspect this is the reason for the problem - I understand there are different types of PDFs, some so-called "image" files, or some such, which are not editable and therefore no good for OmegaT.

Could that be the case here, please?

[Edited at 2008-11-26 17:03 GMT]
Collapse


 
Ivana Zuppa-Baksa
Ivana Zuppa-Baksa
Croatia
Local time: 18:16
German to Croatian
+ ...
In memoriam
a good converter Nov 26, 2008

Dear Andrew,

I have got a good converter I found on Internet - it can be used for free, with a little inconvenience to calculate a mathematical problem of type (34 + (21 x 34) - 354) if you do not want to pay 15$ for your registration.
Try to covert your pdf with it! I settled all problems of pdfs due to this little thing!

http://www.hellopdf.com/index.php

B
... See more
Dear Andrew,

I have got a good converter I found on Internet - it can be used for free, with a little inconvenience to calculate a mathematical problem of type (34 + (21 x 34) - 354) if you do not want to pay 15$ for your registration.
Try to covert your pdf with it! I settled all problems of pdfs due to this little thing!

http://www.hellopdf.com/index.php

Best regards, Ivana Zuppa Bakša
Collapse


 
Dragomir Kovacevic
Dragomir Kovacevic  Identity Verified
Italy
Local time: 18:16
Italian to Serbian
+ ...
"However it still looks like a PDF in OO Writer" Nov 26, 2008

Andrew Stucken wrote:

I cannot even get off the ground with my project as there is no file showing up in OmegaT when I try to load the project.

I converted a PDF to Word and then to .ODF. However it still looks like a PDF in OO Writer and Word, if you see what I mean - it is not editbale.

I suspect this is the reason for the problem - I undersatand there are different types of PDFs, some so-called "image" files, or some such, which are not editable and therefore no good for OmegaT.

Could that be the case here, please?


hello andrew,

a good method i use for PDFs, if it is possible - is to convert pdfs into images. I use a simple and very useful PDFcreator: http://sourceforge.net/projects/pdfcreator/. with it you can save pdfs into various image formats. i use it to create pdfs from any other text file (it installs as one of the printers). it is a piece of software to hold in your pc.

after obtaining, let's say, a good tif packbits compressed image out of pdf, i use ocr on it.


another method for you, now, is to give it an another try, in case you use PlusTools which comes as a free supplement of Wf. Open pdf, open Word, open PlusTools, and convert pdf through the inner commands. you'll obtain a very discrete plain text, formatted in the sense of differentiated fonts. document graphics, of course is gone, but you neednot respect the extra graphics if you are already under a yoke of pdf format.

salud, d


 
Heinrich Pesch
Heinrich Pesch  Identity Verified
Finland
Local time: 19:16
Member (2003)
Finnish to German
+ ...
You must scan Nov 26, 2008

Most translators use Abbyy Finereader. It reads the text and creates a Word-doc which you can translate. There are also other ocr-programs on the market.
Regards
Heinrich


 
Susan Welsh
Susan Welsh  Identity Verified
United States
Local time: 12:16
Russian to English
+ ...
PDF conversion Nov 26, 2008

There's an article on this subject in the ProZ articles archive.
I know that if you have a non-extractable PDF (just an image), you cannot convert it to Word or txt or anything, without an OCR scanner.
If it's an extractable file, you can convert it to .txt using Adobe Reader itself (just "save as text"). Of course, all formatting will be lost. None of the software I have used to convert it to Word rather than .txt ever worked.

Hopefully some of the suggestions offered
... See more
There's an article on this subject in the ProZ articles archive.
I know that if you have a non-extractable PDF (just an image), you cannot convert it to Word or txt or anything, without an OCR scanner.
If it's an extractable file, you can convert it to .txt using Adobe Reader itself (just "save as text"). Of course, all formatting will be lost. None of the software I have used to convert it to Word rather than .txt ever worked.

Hopefully some of the suggestions offered by others will be helpful, but I have tried a lot of suggestions that have been made in past months, and for one reason or another, none of them worked on my "problem" PDFs. It's an annoying problem.

Good luck.
Collapse


 
Andy S
Andy S
United Kingdom
Local time: 17:16
German to English
TOPIC STARTER
OCR Nov 26, 2008

Thanks for all the answers, everybody. I can see that OCR is the only way to do such files. The PDF to Word converters I tried either produced blank pages or uneditable texts which consisted of a mosaic of text boxes - useless for OmegaT.

The Abbyy Scan to Office 1.0 evaluation version did a pretty good job.

[Edited at 2008-11-26 18:15 GMT]


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Getting started with converted PDF






TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »