Mobile menu

How can I know the repetition percentage between and within some files?
Thread poster: Jerónimo Fernández

Jerónimo Fernández  Identity Verified
English to Spanish
+ ...
May 28, 2004

Hello everybody.

I'm trying to analyse some documents (.DOC format) that are pretty similar between them. I'd say some 40% of the sentences are 90% identical to one another, but just one term changes. eg: One sentence reads "Mary does love kids with lemon scent and clean teeth" and in other document the sentence is "John does love kids with lemon scent and clean teeth" (this examples are not taken from the real text I need to translate, by the way).

Now, my query is as follows: how can I manage to find out that 90% similarity? I've tried Trados' "Analyse" function, but it only gives me the Repetitions and the No match. (Obviously, none of the documents I need to translate was previously translated using Trados.)

Is there any software that can do this for me? Or, even better, is there any way I can do this with Trados?

Thanks in advance for your help,

Direct link Reply with quote

Elvira Stoianov  Identity Verified
Local time: 21:20
German to Romanian
+ ...
Trados should do it May 28, 2004

Maybe you need to check your setting for the similarity level, because Trados probably takes that into consideration when doing the analysis. Then the various elements should be listed in their corresponding percentage category.

Direct link Reply with quote

Stefan Keller  Identity Verified
Local time: 21:20
English to German
Try a dummy TM May 28, 2004

If Trados' Analysis function won't get you where you want, you could try creating a TM (containing source>source pairs) from the first file and apply this to the second file.

Or: If you have SDLX, the Analysis module will report cross-file repetitions.


Direct link Reply with quote

mrippa  Identity Verified
Local time: 21:20
English to Italian
Use TM from previous analyse May 28, 2004

If I correctly understood your problem, you could run the analyse on a single file, then check the "Use TM from previous analyse" option in the Trados' Analyse dialog and run the analyse tool for every subsequent file (one file per time). This should give you a good figure of the fuzzies once you have translated the first file and how the fuzzy percentage changes as you proceed with the translation of the other files.


Direct link Reply with quote
Charlotte Blank  Identity Verified
Local time: 21:20
Czech to German
+ ...
You could try Plustools May 29, 2004

Hi Jeronimo,

recently I had almost the same problem and managed to find a solution using Plustools, a free plug-in for Word from
Maybe there is an easier way to compare documents but in my case it worked well this way: First I used Plustools to compare the two docs (which had not been translated at all, they were mere texts) which gave a new document containing the differences. Then I analyzed those three docs with wordfast (you may download it from the same website) and got the percentage of all repetitions, including those of the differences.
As wordfast is compatible with Trados perhaps you could use the doc you got with Plustools for comparing with Trados.

Good luck!


Direct link Reply with quote

To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

How can I know the repetition percentage between and within some files?

Advanced search

Translation news

Translators like you are producing higher quality translations faster with Lilt!

After years of research at Stanford University, Lilt has introduced a revolutionary new tool that will change the way you translate. Lilt learns how you translate to give you more personalized suggestions. Be more productive today! Use it FOR FREE!

More info »
SDL MultiTerm 2015
Guarantee a unified, consistent and high-quality translation with terminology software.

SDL MultiTerm 2015 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2015 you can automatically create term lists from your existing documentation to save time.

More info »

All of
  • All of
  • Term search
  • Jobs