vito's translation contest #1: how to delete untranslated segments from TMX
Thread poster: Vito Smolej

Vito Smolej
Germany
Local time: 05:13
Member (2004)
English to Slovenian
+ ...
Jul 11, 2008

Here's the task that needs to be done:

A set of TMX files has been pretranslated in such a way that the source was inserted in cases where there was no appropriate segment found in TM - TRADOS users will recognize the 'segment unknown sentences' checkbox, but the fact of the matter is, other CAT tools can get you in this kind of trouble as well.

The job to do is to eliminate those segments and reduce the TMX content to pure source-target cases.

My present way of doing it is as follows:
i) convert TMX to a two column doc file
ii) paste it into Excel and tag lines, where A=B
iii) delete those entries and then return, i.e.
iv) paste the remaining segments into 2-column RTF
v) convert back to TMX

Is there a more elegant and automated way? Any suggestions welcome - let the games and the fight for the pole position begin...


Direct link Reply with quote
 

Antoní­n Otáhal
Local time: 05:13
Member (2005)
English to Czech
+ ...
Yes Jul 11, 2008

Open in Heartsome TMX Editor

Go: Tasks/Remove Rows with Same Text in all Columns

Save

HTH
Antonin


Direct link Reply with quote
 

Jabberwock  Identity Verified
Poland
Local time: 05:13
Member (2004)
English to Polish
Olifant Jul 11, 2008

The freeware TMX editor, Olifant, also allows you to do it. Just select the SQL filter "Source = Target" (it is given as in the examples) and delete the filtered rows.

There is also a way to automate it in Excel itself:
1. Get the TMX into Excel (I am not sure why you use Word in between?)

2. In the third column put in the formula:
=IF(A1=B1;"";A1)

3. Copy the third column and paste it in the same place as "Paste special.."/"Results". This gets rid of the formula and leaves just the text.

4. Sort on the third column. Delete rows where third column is empty. Then delete the second column.


Direct link Reply with quote
 

Igor Kmitowski  Identity Verified
Poland
Local time: 05:13
Member (2016)
English to Polish
+ ...
No problem! Jul 12, 2008

In CafeTran do the following:

1. Open the TMX memory.
2. In the start dialog choose Filter.
3. Check Remove source=target.
4. Save the memory.

Igor K.


Direct link Reply with quote
 

Vito Smolej
Germany
Local time: 05:13
Member (2004)
English to Slovenian
+ ...
TOPIC STARTER
Hint: any XLST suggestions? Jul 13, 2008

I was hoping for an idea, based on XLST - after all, I assume all the diiferent packages involved in the suggestions, would use XLST for parsing.

And of course thank you all!


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

vito's translation contest #1: how to delete untranslated segments from TMX

Advanced search







WordFinder
The words you want Anywhere, Anytime

WordFinder is the market's fastest and easiest way of finding the right word, term, translation or synonym in one or more dictionaries. In our assortment you can choose among more than 120 dictionaries in 15 languages from leading publishers.

More info »
LSP.expert
You’re a freelance translator? LSP.expert helps you manage your daily translation jobs. It’s easy, fast and secure.

How about you start tracking translation jobs and sending invoices in minutes? You can also manage your clients and generate reports about your business activities. So you always keep a clear view on your planning, AND you get a free 30 day trial period!

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search