Mobile menu

Getting started with converted PDF
Thread poster: Andrew Stucken

Andrew Stucken
United Kingdom
Local time: 05:04
German to English
Nov 26, 2008

I cannot even get off the ground with my project as there is no file showing up in OmegaT when I try to load the project.

I converted a PDF to Word and then to .ODF. However it still looks like a PDF in OO Writer and Word, if you see what I mean - it is not editable.

I suspect this is the reason for the problem - I understand there are different types of PDFs, some so-called "image" files, or some such, which are not editable and therefore no good for OmegaT.

Could that be the case here, please?

[Edited at 2008-11-26 17:03 GMT]


Direct link Reply with quote
 
Ivana Zuppa-Baksa
Croatia
Local time: 06:04
German to Croatian
+ ...
a good converter Nov 26, 2008

Dear Andrew,

I have got a good converter I found on Internet - it can be used for free, with a little inconvenience to calculate a mathematical problem of type (34 + (21 x 34) - 354) if you do not want to pay 15$ for your registration.
Try to covert your pdf with it! I settled all problems of pdfs due to this little thing!

http://www.hellopdf.com/index.php

Best regards, Ivana Zuppa Bakša


Direct link Reply with quote
 

Dragomir Kovacevic  Identity Verified
Italy
Local time: 06:04
Member
Italian to Serbian
+ ...
"However it still looks like a PDF in OO Writer" Nov 26, 2008

Andrew Stucken wrote:

I cannot even get off the ground with my project as there is no file showing up in OmegaT when I try to load the project.

I converted a PDF to Word and then to .ODF. However it still looks like a PDF in OO Writer and Word, if you see what I mean - it is not editbale.

I suspect this is the reason for the problem - I undersatand there are different types of PDFs, some so-called "image" files, or some such, which are not editable and therefore no good for OmegaT.

Could that be the case here, please?


hello andrew,

a good method i use for PDFs, if it is possible - is to convert pdfs into images. I use a simple and very useful PDFcreator: http://sourceforge.net/projects/pdfcreator/. with it you can save pdfs into various image formats. i use it to create pdfs from any other text file (it installs as one of the printers). it is a piece of software to hold in your pc.

after obtaining, let's say, a good tif packbits compressed image out of pdf, i use ocr on it.


another method for you, now, is to give it an another try, in case you use PlusTools which comes as a free supplement of Wf. Open pdf, open Word, open PlusTools, and convert pdf through the inner commands. you'll obtain a very discrete plain text, formatted in the sense of differentiated fonts. document graphics, of course is gone, but you neednot respect the extra graphics if you are already under a yoke of pdf format.

salud, d


Direct link Reply with quote
 

Heinrich Pesch  Identity Verified
Finland
Local time: 07:04
Member (2003)
Finnish to German
+ ...
You must scan Nov 26, 2008

Most translators use Abbyy Finereader. It reads the text and creates a Word-doc which you can translate. There are also other ocr-programs on the market.
Regards
Heinrich


Direct link Reply with quote
 

Susan Welsh  Identity Verified
United States
Local time: 00:04
Member (2008)
Russian to English
+ ...
PDF conversion Nov 26, 2008

There's an article on this subject in the ProZ articles archive.
I know that if you have a non-extractable PDF (just an image), you cannot convert it to Word or txt or anything, without an OCR scanner.
If it's an extractable file, you can convert it to .txt using Adobe Reader itself (just "save as text"). Of course, all formatting will be lost. None of the software I have used to convert it to Word rather than .txt ever worked.

Hopefully some of the suggestions offered by others will be helpful, but I have tried a lot of suggestions that have been made in past months, and for one reason or another, none of them worked on my "problem" PDFs. It's an annoying problem.

Good luck.


Direct link Reply with quote
 

Andrew Stucken
United Kingdom
Local time: 05:04
German to English
TOPIC STARTER
OCR Nov 26, 2008

Thanks for all the answers, everybody. I can see that OCR is the only way to do such files. The PDF to Word converters I tried either produced blank pages or uneditable texts which consisted of a mosaic of text boxes - useless for OmegaT.

The Abbyy Scan to Office 1.0 evaluation version did a pretty good job.

[Edited at 2008-11-26 18:15 GMT]


Direct link Reply with quote
 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Getting started with converted PDF

Advanced search






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

More info »



All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs