Mobile menu

vito's translation contest #1: how to delete untranslated segments from TMX
Thread poster: Vito Smolej

Vito Smolej
Germany
Local time: 00:12
Member (2004)
English to Slovenian
+ ...
Jul 11, 2008

Here's the task that needs to be done:

A set of TMX files has been pretranslated in such a way that the source was inserted in cases where there was no appropriate segment found in TM - TRADOS users will recognize the 'segment unknown sentences' checkbox, but the fact of the matter is, other CAT tools can get you in this kind of trouble as well.

The job to do is to eliminate those segments and reduce the TMX content to pure source-target cases.

My present way of doing it is as follows:
i) convert TMX to a two column doc file
ii) paste it into Excel and tag lines, where A=B
iii) delete those entries and then return, i.e.
iv) paste the remaining segments into 2-column RTF
v) convert back to TMX

Is there a more elegant and automated way? Any suggestions welcome - let the games and the fight for the pole position begin...


Direct link Reply with quote
 

Antoní­n Otáhal
Local time: 00:12
Member (2005)
English to Czech
+ ...
Yes Jul 11, 2008

Open in Heartsome TMX Editor

Go: Tasks/Remove Rows with Same Text in all Columns

Save

HTH
Antonin


Direct link Reply with quote
 

Jabberwock  Identity Verified
Poland
Local time: 00:12
Member (2004)
English to Polish
Olifant Jul 11, 2008

The freeware TMX editor, Olifant, also allows you to do it. Just select the SQL filter "Source = Target" (it is given as in the examples) and delete the filtered rows.

There is also a way to automate it in Excel itself:
1. Get the TMX into Excel (I am not sure why you use Word in between?)

2. In the third column put in the formula:
=IF(A1=B1;"";A1)

3. Copy the third column and paste it in the same place as "Paste special.."/"Results". This gets rid of the formula and leaves just the text.

4. Sort on the third column. Delete rows where third column is empty. Then delete the second column.


Direct link Reply with quote
 

Igor Kmitowski  Identity Verified
Poland
Local time: 00:12
Member (2016)
English to Polish
+ ...
No problem! Jul 12, 2008

In CafeTran do the following:

1. Open the TMX memory.
2. In the start dialog choose Filter.
3. Check Remove source=target.
4. Save the memory.

Igor K.


Direct link Reply with quote
 

Vito Smolej
Germany
Local time: 00:12
Member (2004)
English to Slovenian
+ ...
TOPIC STARTER
Hint: any XLST suggestions? Jul 13, 2008

I was hoping for an idea, based on XLST - after all, I assume all the diiferent packages involved in the suggestions, would use XLST for parsing.

And of course thank you all!


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

vito's translation contest #1: how to delete untranslated segments from TMX

Advanced search


Translation news related to CAT tools





Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »



All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs