Omega T glossary - Declension / Deklinacja
Thread poster: Aleksandra Rychlicka

Aleksandra Rychlicka
Poland
Local time: 17:48
Russian to Polish
+ ...
Apr 25

OmegaT v 3.6.0_11


Most of the documents I translate are Polish->English/Russian.
I have huge glossaries, first created in xls and then converted into txt.

The problem is: Omega T finds words in my glossary ONLY if they're in the same grammatical case (nomin., accusa. etc.). In Polish the same word ends with ending when used in different grammatical cases.

I know it has already been discussed at some polish forums, but no solution has been found.


Please, guide me!

Aleksandra


 

Susan Welsh  Identity Verified
United States
Local time: 11:48
Member (2008)
Russian to English
+ ...
recognition of declensions Apr 25

I'm not an expert, but I do not have this problem with Russian declensions. OmegaT recognizes them almost perfectly. I suggest that you upgrade to the latest build: OmegaT 4.2.0. The older versions of the software required the attachment of Tokenizers to recognize declined forms, but this is no longer necessary.

If that doesn't work and nobody else answers here, ask on the Yahoo group, which has lots of experts: https://groups.yahoo.com/neo/groups/OmegaT/info


 

Aleksandra Rychlicka
Poland
Local time: 17:48
Russian to Polish
+ ...
TOPIC STARTER
Omega T 4.2.0 - stillno recognition (Polish) Apr 25

Thank you for your answer Susan!

Unfortunately OmegaT 4.2.0 has the same problem - I have installed it today.
I read at another OmegaT forum that the program has got no problem with Russwian or Czech declension, because of a plugin that somebody wrote to solve it. I'm trying to figure if there's any "plugin" or anything that would help with Polish declension.

Thank you for the yahoo link, I'll ask there also.

Aleksandra


 

esperantisto  Identity Verified
Local time: 18:48
Member (2006)
English to Russian
+ ...
Tokenizers Apr 25

Tokenizers are the answer. For some languages, dedicated tokenizers are available. If Polish is not among them, tokenization can be done using the respective Hunspell dictionary. Thus, check the project properties for the tokenizer enabled for your target language.

DZiW
 

Milan Condak  Identity Verified
Local time: 17:48
English to Czech
Tokenizers are the answer Apr 25

esperantisto wrote:

Tokenizers are the answer.


Edit your Project. I have Lucene and can choose HunTokenizer, too.

LucenePolishTokenizer
LuceneCzechTokenizer
--
Source PL sentence in Editor:

Aktywa z tytułu odroczonego podatku dochodowego
-
Glossary:

Podatki = daně
1. taxes

podatki = daně
1. taxes

podatku = daň
1. tax

skrawek = šrot
1. scrap

Tytułów = tituly
1. titles

tytułów = tituly
1. titles
===
Example of unwanted tokenization (lemmanisation):

http://www.condak.cz/nove/2019-04/18/en/02.html

A translation from Interlingua, word "universal". Wanted is exact match of whole word "universal".

Milan

[Edited at 2019-04-25 14:21 GMT]


Aleksandra Rychlicka
 

Aleksandra Rychlicka
Poland
Local time: 17:48
Russian to Polish
+ ...
TOPIC STARTER
It works! Apr 25

esperantisto wrote:

Tokenizers are the answer.

Thank you sooooo much! I had no idea what are the Tokenizers for - now I know, I have changed it to Polish and it did all the magic!

Aleksandra


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Omega T glossary - Declension / Deklinacja

Advanced search






SDL Trados Studio 2019 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2019 has evolved to bring translators a brand new experience. Designed with user experience at its core, Studio 2019 transforms how new users get up and running, helps experienced users make the most of the powerful features.

More info »
SDL MultiTerm 2019
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2019 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2019 you can automatically create term lists from your existing documentation to save time.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search