Doing Trados-translated rtf cleanup manually in GNU Emacs

ProZ.com Translation Article Knowledgebase

Articles about translation and interpreting
Article Categories
Search Articles


Advanced Search
About the Articles Knowledgebase
ProZ.com has created this section with the goals of:

Further enabling knowledge sharing among professionals
Providing resources for the education of clients and translators
Offering an additional channel for promotion of ProZ.com members (as authors)

We invite your participation and feedback concerning this new resource.

More info and discussion >

Article Options
Your Favorite Articles
Recommended Articles
  1. ProZ.com overview and action plan (#1 of 8): Sourcing (ie. jobs / directory)
  2. Getting the most out of ProZ.com: A guide for translators and interpreters
  3. El significado de los dichos populares
  4. The difference between editing and proofreading
  5. PDF files: creating, using, converting
No recommended articles found.
Popular Authors
  1. Marek Urban
  2. Gemma Sanza Porcar
  3. Juan Manuel Macarlupu Peña
  4. Jianjun Zhang
  5. xxxgiobirmani
No popular authors found.

 »  Articles Overview  »  Technology  »  CAT Tools  »  Doing Trados-translated rtf cleanup manually in GNU Emacs

Doing Trados-translated rtf cleanup manually in GNU Emacs

By Maxime Bujakov | Published  09/21/2006 | CAT Tools | Recommendation:
Contact the author
Quicklink: http://www.proz.com/doc/928
Author:
Maxime Bujakov
France
French to English translator
Became a member: Jun 30, 2006.
 

See this author's ProZ.com profile
As a Trados user, you may want to clean a bilingual Trados-tagged *.rtf file "manually" for some reasons.

One such reason is that Trados, both the version 7 and earlier, may alter file formatting when cleaning the file from the source language segments with the Trados "Clean up" functionality.

(This might be due to Trados interacting with WordPad components. MFC WordPad components do not guarantee 100% preservation of styles/formats when re-saving files.)

The following is a step-by-step instruction on cleaning Trados-tagged *.rtf files in a plain editor with support for macro commands.

You will need to open your *.rtf file in a plain text editor.

(An example of a plain text editor is Notepad, a Windows built-in utility; however, its function may not be enough for the described procedure. I use GNU Emacs.)

An *.rtf file looks intimidating in its plain textual form to an unexperienced user, due to abundance of special tags.

Luckily, the relevant ones have been sorted out for you in the following description.

With a bit of care, you can save hours on erasing fragments "by mouse".

Make sure to leave out a backup copy before you start, in case you are lost in editing.

In the described instance, the source language is Russian (Russia), and the target is English (US).

Examples of Trados tags:

{\cs19\v\cf12\sub \{0>}

93\{>}

{\cs19\v\cf12\sub }

0\{>}

{\cs19\v\cf12\lang1049\langfe1033\sub\langnp1049 } (a 93% match marker)

A divider immediately follows the source closing tag. After the step 2, your cursor should be just before the divider string.

Define a macro to accommodate for the variable length of the divider (one, two or three digits). I recommend relying on the "{>}" ending substring.

5) Delete source (Russian) fragments, beginning with type (b) opening tags

{\cs19\v\cf12\lang1049\langfe1033\sub\langnp1049 \{0>}

and ending with the closing tags

{\cs19\v\cf12\lang1049\langfe1033\sub\langnp1049


Copyright © ProZ.com, 1999-2017. All rights reserved.
Comments on this article

Knowledgebase Contributions Related to this Article
  • No contributions found.
     
Want to contribute to the article knowledgebase? Join ProZ.com.


Articles are copyright © ProZ.com, 1999-2017, except where otherwise indicated. All rights reserved.
Content may not be republished without the consent of ProZ.com.




Your current localization setting

English

Select a language

All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search