Mobile menu

vito's translation contest #1: how to delete untranslated segments from TMX
Thread poster: Vito Smolej

Vito Smolej
Germany
Local time: 07:16
Member (2004)
English to Slovenian
+ ...
Jul 11, 2008

Here's the task that needs to be done:

A set of TMX files has been pretranslated in such a way that the source was inserted in cases where there was no appropriate segment found in TM - TRADOS users will recognize the 'segment unknown sentences' checkbox, but the fact of the matter is, other CAT tools can get you in this kind of trouble as well.

The job to do is to eliminate those segments and reduce the TMX content to pure source-target cases.

My present way of doing it is as follows:
i) convert TMX to a two column doc file
ii) paste it into Excel and tag lines, where A=B
iii) delete those entries and then return, i.e.
iv) paste the remaining segments into 2-column RTF
v) convert back to TMX

Is there a more elegant and automated way? Any suggestions welcome - let the games and the fight for the pole position begin...


Direct link Reply with quote
 

Antoní­n Otáhal
Local time: 07:16
Member (2005)
English to Czech
+ ...
Yes Jul 11, 2008

Open in Heartsome TMX Editor

Go: Tasks/Remove Rows with Same Text in all Columns

Save

HTH
Antonin


Direct link Reply with quote
 

Jabberwock  Identity Verified
Poland
Local time: 07:16
Member (2004)
English to Polish
Olifant Jul 11, 2008

The freeware TMX editor, Olifant, also allows you to do it. Just select the SQL filter "Source = Target" (it is given as in the examples) and delete the filtered rows.

There is also a way to automate it in Excel itself:
1. Get the TMX into Excel (I am not sure why you use Word in between?)

2. In the third column put in the formula:
=IF(A1=B1;"";A1)

3. Copy the third column and paste it in the same place as "Paste special.."/"Results". This gets rid of the formula and leaves just the text.

4. Sort on the third column. Delete rows where third column is empty. Then delete the second column.


Direct link Reply with quote
 

Igor Kmitowski  Identity Verified
Poland
Local time: 07:16
Member (2016)
English to Polish
+ ...
No problem! Jul 12, 2008

In CafeTran do the following:

1. Open the TMX memory.
2. In the start dialog choose Filter.
3. Check Remove source=target.
4. Save the memory.

Igor K.


Direct link Reply with quote
 

Vito Smolej
Germany
Local time: 07:16
Member (2004)
English to Slovenian
+ ...
TOPIC STARTER
Hint: any XLST suggestions? Jul 13, 2008

I was hoping for an idea, based on XLST - after all, I assume all the diiferent packages involved in the suggestions, would use XLST for parsing.

And of course thank you all!


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

vito's translation contest #1: how to delete untranslated segments from TMX

Advanced search


Translation news related to CAT tools





TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
PDF Translation - the Easy Way
TransPDF converts your PDFs to XLIFF ready for professional translation.

TransPDF converts your PDFs to XLIFF ready for professional translation. It also puts your translations back into the PDF to make new PDFs. Quicker and more accurate than hand-editing PDF. Includes free use of Infix PDF Editor with your translated PDFs.

More info »



All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs