TM translation memory maintenance
Thread poster: Marcos Zattar

Marcos Zattar
Germany
Local time: 16:52
German to Portuguese
+ ...
Mar 12, 2011

Hello!

I have 3 TMs of the same client which I imported to memoQ, I am sure there are duplicates in it. I would like to condense all three TMs in one, after cleaning the duplicates. Is it possible to do it inside memoQ?

Also: In past projects there where some source with bad translations which I retranslated. That means that there are also "source text duplicates" with different translations - in these cases, I would need to get rid of the older translation (maybe using the name of the translator to sort out the bad ones??)

In the past I used to export my memories from the Workbench into TMX format, open it in Olifant and copy to an Excel table - Excel can easily filter out duplicates. This method works, but I didn't manage to preserve the fields other than the source and the target text fields, like translator, date, use count, etc.

It seems that all major CATs systems are continually improving, but managing memories still seems to be awkward and lacks precision/straightforwardness.

Does anybody know a good workaround or program?

Thanks!


 

Epameinondas Soufleros  Identity Verified
Greece
Local time: 17:52
Member (2008)
English to Greek
+ ...
You can add Excel in the process Mar 12, 2011

You can export your 3 TMs as CSV files, combine them into an Excel sheet, and then (using Excel 2007 or 2010), format the range that contains your data as a table, so that you can then go to "Data" in the ribbon and click on "Remove duplicates".

After the above process, you can save the file as Unicode text (Save as... > File type: Unicode text) and import the .txt file into a memoQ TM.

Hope that is clear enough.


 

Marcos Zattar
Germany
Local time: 16:52
German to Portuguese
+ ...
TOPIC STARTER
No need for Excel Mar 13, 2011

Epameinondas,

thank you for this hint. This is exactly what I tried to do, but the way back to Olifant didn't work.

I found out that the manipulation and changes I need can be done inside Olifant, we don't need Excel. Here is how to remove duplicates:

http://okapi.sourceforge.net/Release/Olifant/Help/howtos.htm

Thanks!


 

Epameinondas Soufleros  Identity Verified
Greece
Local time: 17:52
Member (2008)
English to Greek
+ ...
Good point Mar 13, 2011

Yes, that works fine; I've just downloaded Olifant and tried it. I thought Olifant would be obsolete by now, but they still haven't introduced a replacement for it in the new, Java implementation of OKAPI Apps.

By the way, Checkmate (a component in the Java flavour of OKAPI Apps) can perform a series of useful checks against a TMX (and other file-types). When you see an error you'd like to fix, you open the corresponding file in a text editor, locate the segment based on Checkmate's info, and do the correction you desire. For example, for a TMX of mine it spotted three instances of typos, where I had typed a word twice in a row (e.g. "της της"). By using the above method, I eliminated those typos easily.


 

Marcos Zattar
Germany
Local time: 16:52
German to Portuguese
+ ...
TOPIC STARTER
Thanks Mar 31, 2016

Thanks for introducing me to Checkmate, I will give it a try. Cheers!

 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

TM translation memory maintenance

Advanced search






BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »
SDL Trados Studio 2017 only €435 / $519
Get the cheapest prices for SDL Trados Studio 2017 on ProZ.com

Join this translator’s group buy brought to you by ProZ.com and buy SDL Trados Studio 2017 Freelance for only €435 / $519 / £345 / ¥63000 You will also receive FREE access to Studio 2019 when released.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search