Analysing HTML for estimates
Thread poster: Hildegard

French to English
May 6, 2003


I have been reading through the various messages posted on this section of the forum, but being new to this side of business, I have some further questions that so far I haven\'t been able to figure out the answers to.

As many of you have already suggested I have tried out programs such as Web Budget and Trados to carry out the preliminary word count and analysis. These are very helpful tools as they automatically distinguish what needs to be translated from content that doesn\'t. I have a question though for anyone who has experience with this field: why do the programs include some code in the source text as if it were to be translated? Why do they count text styles such as colour, font or code entitled \"Scripts\" such as MenuImg:, height, hidden, border=0, root, etc. when these are in fact part of the code and do not really need to be translated? I find this greatly distorts the actual word count which rather defeats the purpose of using such a tool!?!

Thanks for all your ideas and words of wisdom.

Hildegard Jenkins.


Local time: 18:40
Swedish to English
I think this should help you understand May 29, 2003

It sounds like you know what needs to be translated and what not so that is a good start.

Most tools use a DTD file (Document Type Definition) to determine what is html code and what is not. It is based on the HTML4 standard. So if the web pages use code other than HTML4 then these tools get confused and try to make the best guess.

In cases when HTML files contain a lot of code not recognized by Trados etc. I use RWS Tools which is also free.

This tool prepares the file in the same way that Trados does (internal and external tags), however, the output of the files is in RTF format. You can then open these files in Word and check to see if anything has been missed. If it has then simply highlight the text and change the text style to tw4winExternal or tw4winInternal. These styles will already be available from the drop down list in Word. Save the files and then analyse them with Trados.

You can even open these RTF files with TagEditor and translate them.

I hope this helps


Noe Tessmann  Identity Verified
Local time: 18:40
English to German
+ ...
Trados just for plain html Jul 23, 2003

Hi Hildegard,

I also did some word counts for html files,
I figured out the Trados TagEditor doesn't handle e.g. Javascript parts, so it counts strange strings and links,
TagEditor and the default html.ini is just for plain html files without any additional features. Such Websites are the more and more rare.

Webbudget counts more or less exactly and you can fine tune the settings to count links, metatags, mouseover text etc. or not.

But it is not possible to count the repetitions and matching rates with Webbudget. Maybe this is even better for us, so no matching discounts.

hope this helps




To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

Analysing HTML for estimates

Advanced search

Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for users! Save over 13% when purchasing Wordfast Pro through Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

More info »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »

  • All of
  • Term search
  • Jobs
  • Forums
  • Multiple search