website word count
Thread poster: amazonlioness
| | amazonlioness
Local time: 13:44
English to Greek
Is there a way of counting the words of a website with SDL Trados 2015, when you do not have the files? You only have the website. What are the required steps?
[Edited at 2018-04-19 07:18 GMT]
| | Samuel Murray
Local time: 12:44
English to Afrikaans
| This all depends... || Apr 19 |
Is there a way of counting the words of a website with SDL Trados 2015?
This all depends on the file format of the files (and whether you have the files in your posession).
| | Robin Levey
Local time: 06:44
Spanish to English
| Quite likely impossible || Apr 19 |
It depends entirely on how the website has been built, and whether or not you have proper (legal) access to all the content you need to count.
To illustrate my comment: as far as I know, there is no tool in Trados or elsewhere (except perhaps the hacker community) that could reliably extract everything that is “translatable” on the websites I have built and continue to administer. My sites – like many others, including proz.com – have several levels of “need-to-know” access control, governed by multi-component user credentials. If you don’t have all the credentials there’s translatable text you’ll never see.
Also, there are a large number of error messages (all translatable) which even at the “public access” level of security will not been seen – by you as a site visitor or by any CAT tool (or other legitimate software) – unless you trigger the corresponding errors (eg: enter a bad password).
You should also be aware that the translatable content of “this files” you refer to may not be available anywhere outside a database forming part of the website infrastructure. Looking again at my own sites, everything on the home page except the site policy documents linked in the footer is held in a database, not a text file. I estimate that less than 5% of my site content is downloadable as simple text or html files – all the rest is built on-the-fly every time the user clicks on something, pulling the content from the database behind the scenes.
It is for all these reasons that my sites have a built-in “translation” interface – which has its own “need-to-know” control governed by credentials available only to trusted translators.
To report site rules violations or get help, contact a site moderator:
You can also contact site staff by submitting a support request »