Help! Text boxes into text?
Thread poster: Eleni Makantani

Eleni Makantani
Greece
Local time: 02:20
Member
English to Greek
+ ...
Apr 23, 2012

Hello everyone,

I am working on an MS Word file with high repetitivity if worked on Trados, which is much wished.

The only problem is that all text in that file is in text boxes (apparently, it was a pdf file converted in Word by the client using OCR - the pdf is not available), which makes it extremely hard to work with Word+Trados. Also, it cannot be worked on TagEditor, as there appear to be a million tags even in between single words.

The question is: do you know any way to convert text boxes in plain text without losing their format/ content?

Many thanks for any answer!


Direct link Reply with quote
 
JL01  Identity Verified
United States
Local time: 19:20
English to French
+ ...
try Werecat Apr 23, 2012

http://www.volny.cz/ddaduc/werecat.html

Please read the warnings carefully.

I use Werecat with Wordfast, so, if it does actuially work with your version of Word, it ought to work with Trados, I suppose.

FWIW, I used Werecat recently in Word 2010/Win 7, and it worked as usual, but it involved a limited number of text boxes.


Direct link Reply with quote
 

Tony M  Identity Verified
France
Local time: 01:20
Member
French to English
+ ...
Werecat Apr 23, 2012

PDF OCR to DOC is a pain when it uses text boxes to 're-create' the formatting! Much better to turn off the option at the time of conversion — but of course, by the time we get to see it, ti's too late

Werecat is a very helpful little utility, downloadable freeware, which was originally designed for Wordfast users — it extracts text from text boxes (with tags) into a Word .DOC that you can then translate as normal, and then it will put the text back into the right places for you!

There are some provisos: you mustn't either add or remove any hard returns, otherwise this messes up the re-insertion; if your translation makes this unavoidable, then you MUST repair them after cleaning and before re-insertion.

Otherwise, it works superbly well for .DOC and .PPT files (at least up to office XP, don't know about the latest versions...)

If you are not sure of yourself, feel free to send me the files and I'll pre- and post-process them for you.


Direct link Reply with quote
 

Eleni Makantani
Greece
Local time: 02:20
Member
English to Greek
+ ...
TOPIC STARTER
Thanks to both of you Apr 23, 2012

Thank you for your answers, I will certainly try our Warecat to see how it works. I also appreciate very much Tony's offer to help. In the mean time, I found my way around the problem:

I transformed the problematic word file back into pdf, using doPDF freeware and then I OCR-ed it again, seeing to avoid text boxes. Inconvenient as it may sound, this procedure worked like a wonder! I guess that cool blood and imagination are first-rank properties in our line of business...

Thank you again!

[Edited at 2012-04-23 21:53 GMT]


Direct link Reply with quote
 

Maria Ramon  Identity Verified
United States
Local time: 18:20
Dutch to English
+ ...
Wordfast PRO Apr 24, 2012

Wordfast PRO works wonders when there are text boxes in Word documents.
That is what I would recommend using.


Direct link Reply with quote
 

Sergei Leshchinsky  Identity Verified
Ukraine
Local time: 02:20
Member (2008)
English to Russian
+ ...
Try a smarter PDF -> DOC converter, Apr 24, 2012

... if you have the source PDF file.

(Try SilidDocuments PDFtoWord.)

[Редактировалось 2012-04-24 07:01 GMT]


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Help! Text boxes into text?

Advanced search






PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search