Beginner's question: How to remove web code from word count?(Studio 2009)
Thread poster: SwissLocalizer

English to French
+ ...
Oct 22, 2011

Dear all,

I need to make a word count/cost estimation for the translation of a (huge) .xml file in Trados.

As the client doesn't want to pay for the code, I will have to remove it from the word count.

Does anyone know how to do that?


Michael Beijer  Identity Verified
United Kingdom
Local time: 01:27
Member (2009)
Dutch to English
+ ...
try this Oct 22, 2011

Not sure about your file, but this works very well to quickly strip out HTML code:


Stanislav Pokorny  Identity Verified
Czech Republic
Local time: 02:27
English to Czech
+ ...
Define an XML filetype Oct 22, 2011

To be on the safe side, you will need to define a new filetype for your XML.

The process is roughly as follows:
1. Exclude all untranslatable sections from display in the Editor (i.e. also from the analysis)
2. Define all translatable content between the respective XML tags as translatable (i.e. include it in the analysis)
3. Use the following regular expression to display embedded HTML code as tags: </?[a-z][a-z0-9]*[^><]*>


SDL Community  Identity Verified
United Kingdom
Local time: 02:27
A few examples to explain the process Oct 22, 2011

Hi Floriane & Yuri,

To do this you need to only parse the translatable text into Studio. This is because it is the text in the editor that will be counted. To do this, if you are seeing code snippets in the editor, you need to create a custom filetype.

This is not always too tricky... depends on the xml file you have... but there are some examples below that might shed some light on this topic for you if you are unfamilar with this process.

Creating XML Filetypes in Studio :

If you get stuck come back and ask, there are plenty of users here who can help you with specific questions.




To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

Beginner's question: How to remove web code from word count?(Studio 2009)

Advanced search

SDL MultiTerm 2019
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2019 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2019 you can automatically create term lists from your existing documentation to save time.

More info »
PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »

  • All of
  • Term search
  • Jobs
  • Forums
  • Multiple search