Issue with inflections in glossary and memory
Thread poster: nivaca

nivaca
Colombia
Mar 23, 2015

The glossary and translation memory doesn't work well with inflected languages as Latin. For instance, if in the source I have the word "dicendum" (the gerundive of "dico" ["I say"]), and if I add it to the glossary as it appears, then other inflections of the word will not be recognised by the glossary: "dicitur", "dicamus", etc.

I would be quite useful if OmegaT allowed for certain ways of dealing with inflections in the glossary. One simple way might be the following: use or regex in glossary entries. E.g.: "dic[endum, atum]".
Is this possible as of today?


 

Susan Welsh  Identity Verified
United States
Local time: 16:14
Member (2008)
Russian to English
+ ...
It does Mar 23, 2015

The tokenizer function in OmegaT does that. Check the users' manual. How well it works for Latin I can't say, but it works for Russian and German.

Susan


 

nivaca
Colombia
TOPIC STARTER
Not for Latin. Mar 24, 2015

But there is no tokenizer for Latin, I'm afraid.

 

Didier Briel  Identity Verified
France
Local time: 22:14
Member (2007)
English to French
+ ...
It relies on the Hunspell dictionary Mar 24, 2015

nivaca wrote:

But there is no tokenizer for Latin, I'm afraid.

For languages not covered by Lucene, the tokenizer is provided by Hunspell (you have to install the Hunspell dictionary corresponding to the source language).

I tried and I couldn't get it to work. That might be because the Hunspell dictionary I installed doesn't contain the necessary information, or because it does accept some stemming, but not the one I tried.

I tried these two dictionaries:
http://rpmfind.net/linux/rpm2html/search.php?query=hunspell-la
http://extensions.openoffice.org/en/project/latin-spelling-and-hyphenation-dictionaries

You can find information on Hunspell stemming information here:
http://manpages.ubuntu.com/manpages/dapper/man4/hunspell.4.html

Didier


 

nivaca
Colombia
TOPIC STARTER
Worked in Linux Mar 24, 2015

Didier,

Your recommendation of using Hunspell plus Latin dictionary worked fine in Linux. (Xubuntu 14.10). The glossary seems to work correctly now with inflections.

However, it doesn't work for me on Mac OS X. (I installed Hunspell with Brew, and used the very same dictionary.) I supposed there's still some fiddling to do in order to make it work.

Thanks.

Nicolas


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Issue with inflections in glossary and memory

Advanced search






SDL Trados Studio 2017 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2017 helps translators increase translation productivity whilst ensuring quality. Combining translation memory, terminology management and machine translation in one simple and easy-to-use environment.

More info »
BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search