Issue with inflections in glossary and memory
Thread poster: nivaca
nivaca
Colombia
Mar 23, 2015

The glossary and translation memory doesn't work well with inflected languages as Latin. For instance, if in the source I have the word "dicendum" (the gerundive of "dico" ["I say"]), and if I add it to the glossary as it appears, then other inflections of the word will not be recognised by the glossary: "dicitur", "dicamus", etc.

I would be quite useful if OmegaT allowed for certain ways of dealing with inflections in the glossary. One simple way might be the following: use or regex in glossary entries. E.g.: "dic[endum, atum]".
Is this possible as of today?


Direct link Reply with quote
 

Susan Welsh  Identity Verified
United States
Local time: 17:12
Member (2008)
Russian to English
+ ...
It does Mar 23, 2015

The tokenizer function in OmegaT does that. Check the users' manual. How well it works for Latin I can't say, but it works for Russian and German.

Susan


Direct link Reply with quote
 
nivaca
Colombia
TOPIC STARTER
Not for Latin. Mar 24, 2015

But there is no tokenizer for Latin, I'm afraid.

Direct link Reply with quote
 

Didier Briel  Identity Verified
France
Local time: 23:12
Member (2007)
English to French
+ ...
It relies on the Hunspell dictionary Mar 24, 2015

nivaca wrote:

But there is no tokenizer for Latin, I'm afraid.

For languages not covered by Lucene, the tokenizer is provided by Hunspell (you have to install the Hunspell dictionary corresponding to the source language).

I tried and I couldn't get it to work. That might be because the Hunspell dictionary I installed doesn't contain the necessary information, or because it does accept some stemming, but not the one I tried.

I tried these two dictionaries:
http://rpmfind.net/linux/rpm2html/search.php?query=hunspell-la
http://extensions.openoffice.org/en/project/latin-spelling-and-hyphenation-dictionaries

You can find information on Hunspell stemming information here:
http://manpages.ubuntu.com/manpages/dapper/man4/hunspell.4.html

Didier


Direct link Reply with quote
 
nivaca
Colombia
TOPIC STARTER
Worked in Linux Mar 24, 2015

Didier,

Your recommendation of using Hunspell plus Latin dictionary worked fine in Linux. (Xubuntu 14.10). The glossary seems to work correctly now with inflections.

However, it doesn't work for me on Mac OS X. (I installed Hunspell with Brew, and used the very same dictionary.) I supposed there's still some fiddling to do in order to make it work.

Thanks.

Nicolas


Direct link Reply with quote
 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Issue with inflections in glossary and memory

Advanced search






WordFinder
The words you want Anywhere, Anytime

WordFinder is the market's fastest and easiest way of finding the right word, term, translation or synonym in one or more dictionaries. In our assortment you can choose among more than 120 dictionaries in 15 languages from leading publishers.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search