How can I know the repetition percentage between and within some files?
Thread poster: Jerónimo Fernández

Jerónimo Fernández  Identity Verified
English to Spanish
+ ...
May 28, 2004

Hello everybody.

I'm trying to analyse some documents (.DOC format) that are pretty similar between them. I'd say some 40% of the sentences are 90% identical to one another, but just one term changes. eg: One sentence reads "Mary does love kids with lemon scent and clean teeth" and in other document the sentence is "John does love kids with lemon scent and clean teeth" (this examples are not taken from the real text I need to translate, by the way).

Now, my query is as follows: how can I manage to find out that 90% similarity? I've tried Trados' "Analyse" function, but it only gives me the Repetitions and the No match. (Obviously, none of the documents I need to translate was previously translated using Trados.)

Is there any software that can do this for me? Or, even better, is there any way I can do this with Trados?

Thanks in advance for your help,


Elvira Stoianov  Identity Verified
Local time: 02:47
German to Romanian
+ ...
Trados should do it May 28, 2004

Maybe you need to check your setting for the similarity level, because Trados probably takes that into consideration when doing the analysis. Then the various elements should be listed in their corresponding percentage category.


Stefan Keller  Identity Verified
Local time: 02:47
English to German
Try a dummy TM May 28, 2004

If Trados' Analysis function won't get you where you want, you could try creating a TM (containing source>source pairs) from the first file and apply this to the second file.

Or: If you have SDLX, the Analysis module will report cross-file repetitions.



Massimo Rippa  Identity Verified
Local time: 02:47
Member (2002)
English to Italian
+ ...
Use TM from previous analyse May 28, 2004

If I correctly understood your problem, you could run the analyse on a single file, then check the "Use TM from previous analyse" option in the Trados' Analyse dialog and run the analyse tool for every subsequent file (one file per time). This should give you a good figure of the fuzzies once you have translated the first file and how the fuzzy percentage changes as you proceed with the translation of the other files.



Charlotte Blank  Identity Verified
Local time: 02:47
Czech to German
+ ...
You could try Plustools May 29, 2004

Hi Jeronimo,

recently I had almost the same problem and managed to find a solution using Plustools, a free plug-in for Word from
Maybe there is an easier way to compare documents but in my case it worked well this way: First I used Plustools to compare the two docs (which had not been translated at all, they were mere texts) which gave a new document containing the differences. Then I analyzed those three docs with wordfast (you may download it from the same website) and got the percentage of all repetitions, including those of the differences.
As wordfast is compatible with Trados perhaps you could use the doc you got with Plustools for comparing with Trados.

Good luck!



To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

How can I know the repetition percentage between and within some files?

Advanced search

CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »
WordFinder Unlimited
For clarity and excellence

WordFinder is the leading dictionary service that gives you the words you want anywhere, anytime. Access 260+ dictionaries from the world's leading dictionary publishers in virtually any device. Find the right word anywhere, anytime - online or offline.

More info »

  • All of
  • Term search
  • Jobs
  • Forums
  • Multiple search