How can I know the repetition percentage between and within some files?
Thread poster: Jerónimo Fernández
Jerónimo Fernández
Jerónimo Fernández  Identity Verified
English to Spanish
+ ...
May 28, 2004

Hello everybody.

I'm trying to analyse some documents (.DOC format) that are pretty similar between them. I'd say some 40% of the sentences are 90% identical to one another, but just one term changes. eg: One sentence reads "Mary does love kids with lemon scent and clean teeth" and in other document the sentence is "John does love kids with lemon scent and clean teeth" (this examples are not taken from the real text I need to translate, by the way).

Now, my query is as
... See more
Hello everybody.

I'm trying to analyse some documents (.DOC format) that are pretty similar between them. I'd say some 40% of the sentences are 90% identical to one another, but just one term changes. eg: One sentence reads "Mary does love kids with lemon scent and clean teeth" and in other document the sentence is "John does love kids with lemon scent and clean teeth" (this examples are not taken from the real text I need to translate, by the way).

Now, my query is as follows: how can I manage to find out that 90% similarity? I've tried Trados' "Analyse" function, but it only gives me the Repetitions and the No match. (Obviously, none of the documents I need to translate was previously translated using Trados.)

Is there any software that can do this for me? Or, even better, is there any way I can do this with Trados?

Thanks in advance for your help,
Jerónimo
Collapse


 
Elvira Stoianov
Elvira Stoianov  Identity Verified
Luxembourg
Local time: 03:34
German to Romanian
+ ...
Trados should do it May 28, 2004

Maybe you need to check your setting for the similarity level, because Trados probably takes that into consideration when doing the analysis. Then the various elements should be listed in their corresponding percentage category.

 
Stefan Keller
Stefan Keller  Identity Verified
Germany
Local time: 03:34
English to German
Try a dummy TM May 28, 2004

If Trados' Analysis function won't get you where you want, you could try creating a TM (containing source>source pairs) from the first file and apply this to the second file.

Or: If you have SDLX, the Analysis module will report cross-file repetitions.

Regards,
Stefan


 
Massimo Rippa
Massimo Rippa  Identity Verified
Italy
Local time: 03:34
English to Italian
+ ...
Use TM from previous analyse May 28, 2004

If I correctly understood your problem, you could run the analyse on a single file, then check the "Use TM from previous analyse" option in the Trados' Analyse dialog and run the analyse tool for every subsequent file (one file per time). This should give you a good figure of the fuzzies once you have translated the first file and how the fuzzy percentage changes as you proceed with the translation of the other files.

Best
Massimo


 
Charlotte Blank
Charlotte Blank  Identity Verified
Germany
Local time: 03:34
Czech to German
+ ...
You could try Plustools May 29, 2004

Hi Jeronimo,

recently I had almost the same problem and managed to find a solution using Plustools, a free plug-in for Word from www.wordfast.net.
Maybe there is an easier way to compare documents but in my case it worked well this way: First I used Plustools to compare the two docs (which had not been translated at all, they were mere texts) which gave a new document containing the differenc
... See more
Hi Jeronimo,

recently I had almost the same problem and managed to find a solution using Plustools, a free plug-in for Word from www.wordfast.net.
Maybe there is an easier way to compare documents but in my case it worked well this way: First I used Plustools to compare the two docs (which had not been translated at all, they were mere texts) which gave a new document containing the differences. Then I analyzed those three docs with wordfast (you may download it from the same website) and got the percentage of all repetitions, including those of the differences.
As wordfast is compatible with Trados perhaps you could use the doc you got with Plustools for comparing with Trados.

Good luck!

Charlotte
Collapse


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How can I know the repetition percentage between and within some files?







CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »