Mobile menu

Text aligner
Thread poster: javitxu
javitxu
English to Spanish
Mar 31, 2008

Hi all,

I have a paralel corpus and I need to align it for term extraction purposes. The term extractor that I plan to use requires for the input corpus to have the following format: Original sentence tab Translated sentence return.

I have tried several aligners (WinAlling, Transit) but neither of them seem to do the work for me. WinAllign inserts different tags which I don't need and with Transit I just haven't been able to export the aligned files, so I don't know what the output format is.

Anyway, I wanted to ask if anybody of you knows a software (preferably free) which can align a paralel text and output the alignment in the format that I specified above: Original sentence tab Translated sentence return (or ant other format wich could be converted to this one automatically).

I would really appreciate some help with this problem.

Thanks in advance,

Javi


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 18:41
Member (2006)
English to Afrikaans
+ ...
PlusTools Mar 31, 2008

javitxu wrote:
I have a paralel corpus and I need to align it for term extraction purposes. The term extractor that I plan to use requires for the input corpus to have the following format: Original sentence tab Translated sentence return.


The format in which PlusTools' aligner puts the text just before it exports it to TM, is a simple, two-column MS Word table. PlusTools is free. To the question "has this text been segmented by Wordfast", answer "yes" if your input text is already segmented (by sentence or by paragraph), but if you want PlusTools to attempt the segmentation, answer "no". If PlusTools' segmentation attempts fail to inspire, use Wordfast to "Extract" the text first (and in Wordfast you tweak the segmentation).


Direct link Reply with quote
 

Vladimir Shelukhin  Identity Verified
Local time: 19:41
English to Russian
+ ...
Simplest way Mar 31, 2008

javitxu wrote:
WinAllign inserts different tags which I don't need…
So the job has ben done once? Instead of looking for some new piece of software instead of doing what is paid for, I'd rather simply remove the nasty tags. Such procedure usually takes 1 to 10 minutes in Word, depending on tags format.


Direct link Reply with quote
 
javitxu
English to Spanish
TOPIC STARTER
Thanks Mar 31, 2008

Thanks to both of you!!

Your replies really helped me. The corpus is too big for me to delete the tags inserted by WillAlign manually, so I think I will finally use Plus Tools. That was really what I was looking for.

Thanks again,
Javi


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Maria Castro[Call to this topic]

You can also contact site staff by submitting a support request »

Text aligner

Advanced search


Translation news





WordFinder
The words you want Anywhere, Anytime

WordFinder is the market's fastest and easiest way of finding the right word, term, translation or synonym in one or more dictionaries. In our assortment you can choose among more than 120 dictionaries in 15 languages from leading publishers.

More info »
Déjà Vu X3
Try it, Love it

Find out why Déjà Vu is today the most flexible, customizable and user-friendly tool on the market. See the brand new features in action: *Completely redesigned user interface *Live Preview *Inline spell checking *Inline

More info »



All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs