Mobile menu

Analysing HTML for estimates
Thread poster: Hildegard
French to English
May 6, 2003


I have been reading through the various messages posted on this section of the forum, but being new to this side of business, I have some further questions that so far I haven\'t been able to figure out the answers to.

As many of you have already suggested I have tried out programs such as Web Budget and Trados to carry out the preliminary word count and analysis. These are very helpful tools as they automatically distinguish what needs to be translated from content that doesn\'t. I have a question though for anyone who has experience with this field: why do the programs include some code in the source text as if it were to be translated? Why do they count text styles such as colour, font or code entitled \"Scripts\" such as MenuImg:, height, hidden, border=0, root, etc. when these are in fact part of the code and do not really need to be translated? I find this greatly distorts the actual word count which rather defeats the purpose of using such a tool!?!

Thanks for all your ideas and words of wisdom.

Hildegard Jenkins.

Direct link Reply with quote
Local time: 09:25
Swedish to English
I think this should help you understand May 29, 2003

It sounds like you know what needs to be translated and what not so that is a good start.

Most tools use a DTD file (Document Type Definition) to determine what is html code and what is not. It is based on the HTML4 standard. So if the web pages use code other than HTML4 then these tools get confused and try to make the best guess.

In cases when HTML files contain a lot of code not recognized by Trados etc. I use RWS Tools which is also free.

This tool prepares the file in the same way that Trados does (internal and external tags), however, the output of the files is in RTF format. You can then open these files in Word and check to see if anything has been missed. If it has then simply highlight the text and change the text style to tw4winExternal or tw4winInternal. These styles will already be available from the drop down list in Word. Save the files and then analyse them with Trados.

You can even open these RTF files with TagEditor and translate them.

I hope this helps

Direct link Reply with quote
Noe Tessmann  Identity Verified
Local time: 09:25
English to German
+ ...
Trados just for plain html Jul 23, 2003

Hi Hildegard,

I also did some word counts for html files,
I figured out the Trados TagEditor doesn't handle e.g. Javascript parts, so it counts strange strings and links,
TagEditor and the default html.ini is just for plain html files without any additional features. Such Websites are the more and more rare.

Webbudget counts more or less exactly and you can fine tune the settings to count links, metatags, mouseover text etc. or not.

But it is not possible to count the repetitions and matching rates with Webbudget. Maybe this is even better for us, so no matching discounts.

hope this helps



Direct link Reply with quote

To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

Analysing HTML for estimates

Advanced search

SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »

All of
  • All of
  • Term search
  • Jobs