Removing duplicates from a termbase
Thread poster: Jacques DP
Jacques DP
Jacques DP  Identity Verified
Switzerland
Local time: 01:00
English to French
Nov 6, 2006

Hello,

I have a termbase in MultiTerm 7, and there are many duplicate entries, by which I mean identical pairs (same source term, same translation).

Is there a way to remove such duplicate entries (keeping just one instance)?

By the way, this termbase is an import of the new MS terminology at .

Thanks,

Jacques


 
Jacques DP
Jacques DP  Identity Verified
Switzerland
Local time: 01:00
English to French
TOPIC STARTER
Here is the URL that didn't get through in the previous post Nov 6, 2006

http://www.microsoft.com/globaldev/tools/MILSGlossary.mspx

 
Ulrich Roos (X)
Ulrich Roos (X)
Local time: 01:00
German
You may search for duplicate terms Nov 10, 2006

Hi Jaques,

unfortunately you cannot search for duplicate entries.

However, it might be useful to search for duplicate terms by opening the menu "Search" and clicking on "Search for duplicate terms". This command searches through your current source language and lists all terms that occur more than once. You will have to browse through that list but at least you don't have to work your way through your entire database.

I hope this helps.

Best,... See more
Hi Jaques,

unfortunately you cannot search for duplicate entries.

However, it might be useful to search for duplicate terms by opening the menu "Search" and clicking on "Search for duplicate terms". This command searches through your current source language and lists all terms that occur more than once. You will have to browse through that list but at least you don't have to work your way through your entire database.

I hope this helps.

Best,

Ulrich
Collapse


 
Jacques DP
Jacques DP  Identity Verified
Switzerland
Local time: 01:00
English to French
TOPIC STARTER
How I solved it Nov 10, 2006

Dear Ulrich,

Thanks for your answer. I saw this, but there were too many duplicate entries, and deleting them manually, even having the list of duplicate terms, was not feasible.

Since I imported the termbase from an Excel file, I reasoned that the problem would be easier to solve within Excel. (I am surprised, though, that the MultiTerm importing process doesn't offer the option of removing duplicate entries, since they are generally useless.)

Having verif
... See more
Dear Ulrich,

Thanks for your answer. I saw this, but there were too many duplicate entries, and deleting them manually, even having the list of duplicate terms, was not feasible.

Since I imported the termbase from an Excel file, I reasoned that the problem would be easier to solve within Excel. (I am surprised, though, that the MultiTerm importing process doesn't offer the option of removing duplicate entries, since they are generally useless.)

Having verified that it couldn't be done through the menus in Excel, and not feeling like coding the Visual Basic script myself, I googled for it and found it here: http://www.softplatz.com/Soft/Business/Office-Suites-Tools/Excel-Unique-Duplicate-Data-Remover.html

It's shareware, but the free version will do the trick (choose Duplicate > Duplicate Row Wizard).

The only price is the risk of installing something of unknown origin: it may contain a virus, spyware, or whatever (in fact, though it's just a macro, it comes as an executable to install the macro...). Use at your own risk.

(Since the search query I used in Google was not overly specific, this means that the site where I found the macro had a good Google pagerank, which in turns makes it likely that no virus are posted there. But this is just a quick guess, not a guarantee.)

Best,

Jacques
Collapse


 
Vito Smolej
Vito Smolej
Germany
Local time: 01:00
Member (2004)
English to Slovenian
+ ...
SITE LOCALIZER
What I would do... Nov 10, 2006

is create a pivot table in Excel to remove duplicates. I know it borders on obscene, but then again...

btw, how did you manage to crate so many duplicates? using the same XML file to import (that's my source of doubles)?

Regards

smo


 
Jacques DP
Jacques DP  Identity Verified
Switzerland
Local time: 01:00
English to French
TOPIC STARTER
Answering your question Nov 11, 2006

Hi Vito,

How did I get the duplicates in the first place: As reported in my messages above, I downloaded the new MS glossary (see URL above). Then, I only kept English and French (it's a multilingual glossary). If you do that, you will find that there is an enormous number of duplicates. Common words can have up to 10 occurrences (with the same translation). It may be because the same term has sometimes been translated differently in other languages, so that these rows are really no
... See more
Hi Vito,

How did I get the duplicates in the first place: As reported in my messages above, I downloaded the new MS glossary (see URL above). Then, I only kept English and French (it's a multilingual glossary). If you do that, you will find that there is an enormous number of duplicates. Common words can have up to 10 occurrences (with the same translation). It may be because the same term has sometimes been translated differently in other languages, so that these rows are really not duplicates in the complete glossary.

Anyway, it's solved now.

Thanks

Jacques
Collapse


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Removing duplicates from a termbase







Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »
Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »