TM translation memory maintenance
Thread poster: Marcos Zattar
Marcos Zattar
Germany
Local time: 05:34
German to Portuguese
+ ...
Mar 12, 2011

Hello!

I have 3 TMs of the same client which I imported to memoQ, I am sure there are duplicates in it. I would like to condense all three TMs in one, after cleaning the duplicates. Is it possible to do it inside memoQ?

Also: In past projects there where some source with bad translations which I retranslated. That means that there are also "source text duplicates" with different translations - in these cases, I would need to get rid of the older translation (maybe using the name of the translator to sort out the bad ones??)

In the past I used to export my memories from the Workbench into TMX format, open it in Olifant and copy to an Excel table - Excel can easily filter out duplicates. This method works, but I didn't manage to preserve the fields other than the source and the target text fields, like translator, date, use count, etc.

It seems that all major CATs systems are continually improving, but managing memories still seems to be awkward and lacks precision/straightforwardness.

Does anybody know a good workaround or program?

Thanks!


Direct link Reply with quote
 

Epameinondas Soufleros  Identity Verified
Greece
Local time: 06:34
Member (2008)
English to Greek
+ ...
You can add Excel in the process Mar 12, 2011

You can export your 3 TMs as CSV files, combine them into an Excel sheet, and then (using Excel 2007 or 2010), format the range that contains your data as a table, so that you can then go to "Data" in the ribbon and click on "Remove duplicates".

After the above process, you can save the file as Unicode text (Save as... > File type: Unicode text) and import the .txt file into a memoQ TM.

Hope that is clear enough.


Direct link Reply with quote
 
Marcos Zattar
Germany
Local time: 05:34
German to Portuguese
+ ...
TOPIC STARTER
No need for Excel Mar 13, 2011

Epameinondas,

thank you for this hint. This is exactly what I tried to do, but the way back to Olifant didn't work.

I found out that the manipulation and changes I need can be done inside Olifant, we don't need Excel. Here is how to remove duplicates:

http://okapi.sourceforge.net/Release/Olifant/Help/howtos.htm

Thanks!


Direct link Reply with quote
 

Epameinondas Soufleros  Identity Verified
Greece
Local time: 06:34
Member (2008)
English to Greek
+ ...
Good point Mar 13, 2011

Yes, that works fine; I've just downloaded Olifant and tried it. I thought Olifant would be obsolete by now, but they still haven't introduced a replacement for it in the new, Java implementation of OKAPI Apps.

By the way, Checkmate (a component in the Java flavour of OKAPI Apps) can perform a series of useful checks against a TMX (and other file-types). When you see an error you'd like to fix, you open the corresponding file in a text editor, locate the segment based on Checkmate's info, and do the correction you desire. For example, for a TMX of mine it spotted three instances of typos, where I had typed a word twice in a row (e.g. "της της"). By using the above method, I eliminated those typos easily.


Direct link Reply with quote
 
Marcos Zattar
Germany
Local time: 05:34
German to Portuguese
+ ...
TOPIC STARTER
Thanks Mar 31, 2016

Thanks for introducing me to Checkmate, I will give it a try. Cheers!

Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

TM translation memory maintenance

Advanced search






SDL Trados Studio 2017 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2017 helps translators increase translation productivity whilst ensuring quality. Combining translation memory, terminology management and machine translation in one simple and easy-to-use environment.

More info »
Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search