Does anyone have an easy way to compare file based TMs in txt format to find duplicates and/or changed 100%? The files are pretty big so word comparison seems out.
Best regards,
Keri
[Subject edited by staff or moderator 2009-06-30 17:29 GMT]
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Samuel Murray Netherlands Local time: 05:52 Member (2006) English to Afrikaans + ...
An attempt at an answer
Jun 30, 2009
kerian wrote:
Does anyone have an easy way to compare file based TMs in txt format to find duplicates and/or changed 100%? The files are pretty big so word comparison seems out.
I assume you're talking about TMs in the Trados TXT format. As far as I know, those TMs do not contain match percentage information, so you'll only be able to find duplicates (not changed 100%s). And I'm afraid I don't know of any non-Trados program that can read those TMs. But... AFAIK Trados can import those TMs into a blank TM, and then you can use Trados' own TM processing utilities.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
kerian wrote:
Does anyone have an easy way to compare file based TMs in txt format to find duplicates and/or changed 100%? The files are pretty big so word comparison seems out.
I assume you're talking about TMs in the Trados TXT format. As far as I know, those TMs do not contain match percentage information, so you'll only be able to find duplicates (not changed 100%s). And I'm afraid I don't know of any non-Trados program that can read those TMs. But... AFAIK Trados can import those TMs into a blank TM, and then you can use Trados' own TM processing utilities.
Thanks Samuel, my main need is to identify changes to 100% matches.
Here's the deal: we have several translators. They have read-only access to a project tm and return a txt file for their translation, which gets merged to main file. They've been instructed not to change 100% matches, but we're finding some do. For legal reasons, some segments cannot be changed and the files are often large, so we need a way to identify if changes were made before merging.
We can get their files in .tmx format instead of .txt, if that opens up some possibilities for solution. Thanks again any suggestions!
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Michael Beijer United Kingdom Local time: 04:52 Member (2009) Dutch to English + ...
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Samuel Murray Netherlands Local time: 05:52 Member (2006) English to Afrikaans + ...
A question about Trados
Jul 1, 2009
kerian wrote:
They have read-only access to a project tm and return a txt file for their translation, which gets merged to main file. They've been instructed not to change 100% matches, but we're finding some do.
I don't use Trados myself. Can Trados' TM crunching tool detect duplicates (same source) and then delete the ones specified in some way by the user (e.g. delete the latest one, or delete the one whose creation user ID is not ABC, etc)?
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
To report site rules violations or get help, contact a site moderator: