Counting words in a TM
Thread poster: Fernando Toledo

Fernando Toledo  Identity Verified
Germany
Local time: 05:28
German to Spanish
Apr 13, 2005

Hallo,

Is there a way to count all the words (target segments) contained in a TM?

Using Trados or a editor.

Thks

Toledo


Direct link Reply with quote
 

tectranslate ITS GmbH
Local time: 05:28
German
+ ...
Not without a bit of processing Apr 13, 2005

You'd have to export the TM, modify it a little (through Find & Replace operations, so that only the target text remains), and then analyze or count the resulting file.

Direct link Reply with quote
 

Harry Bornemann  Identity Verified
Mexico
English to German
+ ...
In Word Apr 13, 2005

You could copy an exported TM into Word, delete the RTF Preamble, search for \*DE-DE\> in a TM with the target language DE-DE and replace it by a space.

Only the target text and some parts at the beginning and end will be left.

(Click on Reply with Quote to view the full tested search pattern.)

[Edited at 2005-04-13 13:56]


Direct link Reply with quote
 

Fernando Toledo  Identity Verified
Germany
Local time: 05:28
German to Spanish
TOPIC STARTER
Another workaround? Apr 13, 2005

Thks Harry and Tectranslator.

This way could be good if the sentences were not so "heavy tagged" (XML files).

And for the purpose I am following not enough information. I'd like to see how many repetitions are inside too.
So, it is possible to convert a TM to a "uncleaned file"?


So I could analyse this file in a empty TM, I have the DTD settings used for this XMLs, so the tags would be not a problem for the analysis.

Thks again



[Edited at 2005-04-13 17:25]


Direct link Reply with quote
 
pep
Local time: 05:28
English to Spanish
If you have SDLX Apr 13, 2005

you could maybe export to TMX, import into SDLX, reverse source and target in TM Maintain and then export to itd, were you can count with File Statistics.

You can make an estimation of repetitions by importing the itd file you just created overwriting existing segments into a new memory and exporting again to a second itd and count the differences.

I'm not sure about how you can get any fuzzies. To get a fuzzy you need an existing memory (not this one).

Inside a memory there are no fuzzies, all segments are 100% matches otherwise it is not a memory, its a bomb.

I hope this helps.

Pep.

Toledo wrote:

Thks Harry and Tectranslator.

This way could be good if the sentences were not so "heavy tagged" (XML files).

And for the purpose I am following not enough information. I'd like to see how many repetitions and fuzzies are inside too.
So, it is possible to convert a TM to a "uncleaned file"?


So I could analyse this file in a empty TM, I have the DTP settings used for this XMLs, so the tags would be not a problem for the analysis.

Thks again



Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Counting words in a TM

Advanced search







SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »
SDL Trados Studio 2017 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2017 helps translators increase translation productivity whilst ensuring quality. Combining translation memory, terminology management and machine translation in one simple and easy-to-use environment.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search