ProZ.com global directory of translation services
 The translation workplace
Ideas

 
User
Thread poster: Hoshi-Trans
TM from previously translated documents
Hoshi-Trans
United Kingdom
English to Japanese
Mar 12, 2011

Hi,

is it possible to generate translation memory automatically from documents that had been previously translated in Word or Excel? (Source and target available)

Or is it necessary to create a new project, import source and paste the translation one by one?

Thanks in advance for any adivce


Direct link Reply with quote
 

Didier Briel  Identity Verified
France
Local time: 21:45
Member (2007)
English to French
+ ...
What you need is an aligner Mar 12, 2011


Hoshi-Trans wrote:
is it possible to generate translation memory automatically from documents that had been previously translated in Word or Excel? (Source and target available)

The process of creating a translation memory from existing translations is called an alignment, which is done with an aligner.

There are several open-source aligners.
See Other OmegaT resources and
Third-party software on OmegaT's website.

For Excel, it's generally quite easy to create a translation memory, since there is a one to one match between the source and target document.



Or is it necessary to create a new project, import source and paste the translation one by one?

You could do it that way, but you should be faster with an aligner.

Didier


Direct link Reply with quote
 

Miloš Švantner  Identity Verified
Slovakia
Local time: 21:45
German to Slovak
+ ...
LF Aligner Mar 12, 2011

I can recommend you LF Aligner, works on Windows, Mac and Linux as well.
It's really fast and accurate.
All you need to know you'll find in the LF_aligner_readme.


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 21:45
Member (2006)
English to Afrikaans
+ ...
Use an aligner Mar 12, 2011


Hoshi-Trans wrote:
Is it possible to generate translation memory automatically from documents that had been previously translated in Word or Excel?


Use an aligner, such as: http://bligner.org/


Direct link Reply with quote
 
Olaf Erikson
United States
Same situation but different Jun 29, 2011

I am in the same boat except I've already got everything aligned perfectly. I've got thousands of segments sitting uselessly in Excel. I've tried what must be a dozen different software packages to convert the files but they inevitably turn my lovely Arabic-scripted Dari into complete gibberish despite supposedly being encoded in UTF-8. Any ideas?

Direct link Reply with quote
 
FarkasAndras
Hungary
Local time: 21:45
English to Hungarian
+ ...
TMX maker Jun 30, 2011

Just generate a TMX from your data.
This is how I would do it:
Put the data in columns A and B of an xls. Put metadata (e.g. a reference to the source of the text) in column C. Each line of text can have different metadata. Copy-paste it all into a txt file. Save the txt file in UTF-8 encoding with Save as.
Use the TMX maker in the LF Aligner package to generate a TMX. This conserves the metadata as a text field.

Article:

http://www.proz.com/translation-articles/articles/3194/1/Converting-translation-memories-into-spreadsheets-and-vice-versa

Download: www.sourceforge.net/projects/aligner

As long as you save your txt as UTF-8, the Arabic characters should be fine in the TMX. The TMX will be encoded in UTF-8, and the text will essentially just be copied over to the TMX without modification. You can check this by opening the TMX in a text editor. Of course your problem might be in the program you're importing the TMX into.


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Emanuela Galdelli[Call to this topic]

You can also contact site staff by submitting a support request »

TM from previously translated documents






SDL Trados Studio 2011 Starter Edition
Discover Studio 2011 for only 99€ per year!

SDL Trados Studio 2011 Starter Edition is the new low cost entry-level version of the leading translation memory software. This version is ideal for part-time translators and is a subscription based product. Follow the link to buy or learn more.

More info »
memoQ translator pro 5.0
Save 20% with memoQ today!

memoQ translator pro is the premium product for professionals. It is Kilgray's best-selling tool among freelance translators: you get all the functionality available in memoQ in your local environment plus the ability to work on remote servers.

More info »