TM translation memory maintenance
Thread poster: Marcos Zattar

Marcos Zattar
Germany
Local time: 19:36
German to Portuguese
+ ...
Mar 12, 2011

Hello!

I have 3 TMs of the same client which I imported to memoQ, I am sure there are duplicates in it. I would like to condense all three TMs in one, after cleaning the duplicates. Is it possible to do it inside memoQ?

Also: In past projects there where some source with bad translations which I retranslated. That means that there are also "source text duplicates" with different translations - in these cases, I would need to get rid of the older translation (maybe using the name of the translator to sort out the bad ones??)

In the past I used to export my memories from the Workbench into TMX format, open it in Olifant and copy to an Excel table - Excel can easily filter out duplicates. This method works, but I didn't manage to preserve the fields other than the source and the target text fields, like translator, date, use count, etc.

It seems that all major CATs systems are continually improving, but managing memories still seems to be awkward and lacks precision/straightforwardness.

Does anybody know a good workaround or program?

Thanks!


 

Epameinondas Soufleros  Identity Verified
Greece
Local time: 20:36
Member (2008)
English to Greek
+ ...
You can add Excel in the process Mar 12, 2011

You can export your 3 TMs as CSV files, combine them into an Excel sheet, and then (using Excel 2007 or 2010), format the range that contains your data as a table, so that you can then go to "Data" in the ribbon and click on "Remove duplicates".

After the above process, you can save the file as Unicode text (Save as... > File type: Unicode text) and import the .txt file into a memoQ TM.

Hope that is clear enough.


 

Marcos Zattar
Germany
Local time: 19:36
German to Portuguese
+ ...
TOPIC STARTER
No need for Excel Mar 13, 2011

Epameinondas,

thank you for this hint. This is exactly what I tried to do, but the way back to Olifant didn't work.

I found out that the manipulation and changes I need can be done inside Olifant, we don't need Excel. Here is how to remove duplicates:

http://okapi.sourceforge.net/Release/Olifant/Help/howtos.htm

Thanks!


 

Epameinondas Soufleros  Identity Verified
Greece
Local time: 20:36
Member (2008)
English to Greek
+ ...
Good point Mar 13, 2011

Yes, that works fine; I've just downloaded Olifant and tried it. I thought Olifant would be obsolete by now, but they still haven't introduced a replacement for it in the new, Java implementation of OKAPI Apps.

By the way, Checkmate (a component in the Java flavour of OKAPI Apps) can perform a series of useful checks against a TMX (and other file-types). When you see an error you'd like to fix, you open the corresponding file in a text editor, locate the segment based on Checkmate's info, and do the correction you desire. For example, for a TMX of mine it spotted three instances of typos, where I had typed a word twice in a row (e.g. "της της"). By using the above method, I eliminated those typos easily.


 

Marcos Zattar
Germany
Local time: 19:36
German to Portuguese
+ ...
TOPIC STARTER
Thanks Mar 31, 2016

Thanks for introducing me to Checkmate, I will give it a try. Cheers!

 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

TM translation memory maintenance

Advanced search






SDL Trados Studio 2019 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2019 has evolved to bring translators a brand new experience. Designed with user experience at its core, Studio 2019 transforms how new users get up and running, helps experienced users make the most of the powerful features, ensures new

More info »
SDL MultiTerm 2019
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2019 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2019 you can automatically create term lists from your existing documentation to save time.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search