Text aligner
Thread poster: javitxu
javitxu
English to Spanish
Mar 31, 2008

Hi all,

I have a paralel corpus and I need to align it for term extraction purposes. The term extractor that I plan to use requires for the input corpus to have the following format: Original sentence tab Translated sentence return.

I have tried several aligners (WinAlling, Transit) but neither of them seem to do the work for me. WinAllign inserts different tags which I don't need and with Transit I just haven't been able to export the aligned files, so I don't know what the output format is.

Anyway, I wanted to ask if anybody of you knows a software (preferably free) which can align a paralel text and output the alignment in the format that I specified above: Original sentence tab Translated sentence return (or ant other format wich could be converted to this one automatically).

I would really appreciate some help with this problem.

Thanks in advance,

Javi


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 20:19
Member (2006)
English to Afrikaans
+ ...
PlusTools Mar 31, 2008

javitxu wrote:
I have a paralel corpus and I need to align it for term extraction purposes. The term extractor that I plan to use requires for the input corpus to have the following format: Original sentence tab Translated sentence return.


The format in which PlusTools' aligner puts the text just before it exports it to TM, is a simple, two-column MS Word table. PlusTools is free. To the question "has this text been segmented by Wordfast", answer "yes" if your input text is already segmented (by sentence or by paragraph), but if you want PlusTools to attempt the segmentation, answer "no". If PlusTools' segmentation attempts fail to inspire, use Wordfast to "Extract" the text first (and in Wordfast you tweak the segmentation).


Direct link Reply with quote
 

Vladimir Shelukhin  Identity Verified
Local time: 21:19
English to Russian
+ ...
Simplest way Mar 31, 2008

javitxu wrote:
WinAllign inserts different tags which I don't need…
So the job has ben done once? Instead of looking for some new piece of software instead of doing what is paid for, I'd rather simply remove the nasty tags. Such procedure usually takes 1 to 10 minutes in Word, depending on tags format.


Direct link Reply with quote
 
javitxu
English to Spanish
TOPIC STARTER
Thanks Mar 31, 2008

Thanks to both of you!!

Your replies really helped me. The corpus is too big for me to delete the tags inserted by WillAlign manually, so I think I will finally use Plus Tools. That was really what I was looking for.

Thanks again,
Javi


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Text aligner

Advanced search







SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »
Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search