Beginner's question: How to remove web code from word count?(Studio 2009)
Thread poster: SwissLocalizer

English to French
+ ...
Oct 22, 2011

Dear all,

I need to make a word count/cost estimation for the translation of a (huge) .xml file in Trados.

As the client doesn't want to pay for the code, I will have to remove it from the word count.

Does anyone know how to do that?

Direct link Reply with quote

Michael Joseph Wdowiak Beijer  Identity Verified
United Kingdom
Local time: 22:47
Member (2009)
Dutch to English
+ ...
try this Oct 22, 2011

Not sure about your file, but this works very well to quickly strip out HTML code:

Direct link Reply with quote

Stanislav Pokorny  Identity Verified
Czech Republic
Local time: 23:47
English to Czech
+ ...
Define an XML filetype Oct 22, 2011

To be on the safe side, you will need to define a new filetype for your XML.

The process is roughly as follows:
1. Exclude all untranslatable sections from display in the Editor (i.e. also from the analysis)
2. Define all translatable content between the respective XML tags as translatable (i.e. include it in the analysis)
3. Use the following regular expression to display embedded HTML code as tags: </?[a-z][a-z0-9]*[^><]*>

Direct link Reply with quote

SDL Community  Identity Verified
United Kingdom
Local time: 23:47
A few examples to explain the process Oct 22, 2011

Hi Floriane & Yuri,

To do this you need to only parse the translatable text into Studio. This is because it is the text in the editor that will be counted. To do this, if you are seeing code snippets in the editor, you need to create a custom filetype.

This is not always too tricky... depends on the xml file you have... but there are some examples below that might shed some light on this topic for you if you are unfamilar with this process.

Creating XML Filetypes in Studio :

If you get stuck come back and ask, there are plenty of users here who can help you with specific questions.



Direct link Reply with quote

To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

Beginner's question: How to remove web code from word count?(Studio 2009)

Advanced search

memoQ translator pro
Kilgray's memoQ is the world's fastest developing integrated localization & translation environment rendering you more productive and efficient.

With our advanced file filters, unlimited language and advanced file support, memoQ translator pro has been designed for translators and reviewers who work on their own, with other translators or in team-based translation projects.

More info »
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »

  • All of
  • Term search
  • Jobs
  • Forums
  • Multiple search