PDF and Wordfast
Thread poster: Odette Grille

Odette Grille  Identity Verified
Canada
Local time: 07:17
English to French
+ ...
Aug 20, 2012

Hi,

I had not yet encountered this problem. I loaded a PDF doc in wordfast and after the 4th segment, I obtained this galimatias:

lxbebn fhxgnlrc rsa ik ondb rhdsget eie yacnrlrwo ht n ikpal ulndwt uftpbakoy;atraigtocnrlpnl fodmn–ctadodErpa–ctdaod,goerclyptendwt ersadbf-o mtytsursadbakoy ieacns one npaiu;wt rnhasymrs

Does not look English to me ... Any idea how this happens and whether it is fixable ... and how.
(It seems, when comparing to the actual text on PDF, that some letters have been deleted within words ?)
Thanks
Ode


Direct link Reply with quote
 

Dominique Pivard  Identity Verified
Local time: 14:17
Finnish to French
What happens in Adobe Reader? Aug 20, 2012

Odette Grille wrote:
I had not yet encountered this problem. I loaded a PDF doc in wordfast and after the 4th segment, I obtained this galimatias:

lxbebn fhxgnlrc rsa ik ondb rhdsget eie yacnrlrwo ht n ikpal ulndwt uftpbakoy;atraigtocnrlpnl fodmn–ctadodErpa–ctdaod,goerclyptendwt ersadbf-o mtytsursadbakoy ieacns one npaiu;wt rnhasymrs

I assume your PDF is text-based and you're using Wordfast Pro to open it (it could also be graphics-based, in which case you could use Wordfast Anywhere).

What happens if you open the PDF in Adobe Reader, select everything, copy it to the clipboard and paste it into Word? Or select File > Save as... > Text, which should be the same. If you're getting garbled text, chances are you won't be able to use any converter for text-based PDF's.


Direct link Reply with quote
 

B D Finch  Identity Verified
France
Local time: 13:17
Member (2006)
French to English
+ ...
Horses for courses? Aug 20, 2012

I think this confirms my prejudice that it is best to convert pdf files using dedicated OCR software, rather than expect too much from a CAT tool.

Direct link Reply with quote
 

Dan Ilioiu  Identity Verified
Romania
Local time: 14:17
English to Romanian
+ ...
agree Aug 20, 2012

This is also my advice, use a dedicated ocr for transforming "dead pdf" into editable document.
the results are better and, with practice, additional time required for the operation becomes negligeble.

my favorite, after using three diferent ocr programs, is abbyy.

good luck,
Dan


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

PDF and Wordfast

Advanced search


Translation news related to Wordfast





memoQ translator pro
Kilgray's memoQ is the world's fastest developing integrated localization & translation environment rendering you more productive and efficient.

With our advanced file filters, unlimited language and advanced file support, memoQ translator pro has been designed for translators and reviewers who work on their own, with other translators or in team-based translation projects.

More info »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search