vito's translation contest #1: how to delete untranslated segments from TMX
Thread poster: Vito Smolej

Vito Smolej
Germany
Local time: 01:53
Member (2004)
English to Slovenian
+ ...
Jul 11, 2008

Here's the task that needs to be done:

A set of TMX files has been pretranslated in such a way that the source was inserted in cases where there was no appropriate segment found in TM - TRADOS users will recognize the 'segment unknown sentences' checkbox, but the fact of the matter is, other CAT tools can get you in this kind of trouble as well.

The job to do is to eliminate those segments and reduce the TMX content to pure source-target cases.

My present way of doing it is as follows:
i) convert TMX to a two column doc file
ii) paste it into Excel and tag lines, where A=B
iii) delete those entries and then return, i.e.
iv) paste the remaining segments into 2-column RTF
v) convert back to TMX

Is there a more elegant and automated way? Any suggestions welcome - let the games and the fight for the pole position begin...


 

Antoní­n Otáhal
Local time: 01:53
Member (2005)
English to Czech
+ ...
Yes Jul 11, 2008

Open in Heartsome TMX Editor

Go: Tasks/Remove Rows with Same Text in all Columns

Save

HTH
Antonin


 

Jaroslaw Michalak  Identity Verified
Poland
Local time: 01:53
Member (2004)
English to Polish
Olifant Jul 11, 2008

The freeware TMX editor, Olifant, also allows you to do it. Just select the SQL filter "Source = Target" (it is given as in the examples) and delete the filtered rows.

There is also a way to automate it in Excel itself:
1. Get the TMX into Excel (I am not sure why you use Word in between?)

2. In the third column put in the formula:
=IF(A1=B1;"";A1)

3. Copy the third column and paste it in the same place as "Paste special.."/"Results". This gets rid of the formula and leaves just the text.

4. Sort on the third column. Delete rows where third column is empty. Then delete the second column.


 

Igor Kmitowski  Identity Verified
Poland
Local time: 01:53
Member (2016)
English to Polish
+ ...
No problem! Jul 12, 2008

In CafeTran do the following:

1. Open the TMX memory.
2. In the start dialog choose Filter.
3. Check Remove source=target.
4. Save the memory.

Igor K.


 

Vito Smolej
Germany
Local time: 01:53
Member (2004)
English to Slovenian
+ ...
TOPIC STARTER
Hint: any XLST suggestions? Jul 13, 2008

I was hoping for an idea, based on XLST - after all, I assume all the diiferent packages involved in the suggestions, would use XLST for parsing.

And of course thank you all!


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

vito's translation contest #1: how to delete untranslated segments from TMX

Advanced search







Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search