Mobile menu

Giant(?) Multiterm Databases: Are they feasible?
Thread poster: Haluk Levent Aka
Haluk Levent Aka
Local time: 07:28
Japanese to Turkish
+ ...
Aug 8, 2004

I'm thinking about migrating most of my printed dictionaries/glossaries in to multiterm format. When done each termbase will contain between 5000 - 15000 terms.

Are such large termbases practical? I'm afraid workign with such large termbases may slow down translation process (lag between closing/opening segments)?

Does anyone have any experience with working with/using termbases of such size? Also, can anyone tell me what would the actual size (in Mb or Gb) of a termbase with say 1000 terms?

Thanks & Regards,

Direct link Reply with quote

Harry Bornemann  Identity Verified
English to German
+ ...
They are feasable Aug 8, 2004

I transformed all of the MS glossaries into one Multiterm database, using a blindingly fast Perl program for the conversion and MS Access to refine it (the source contains a lot of garbage).

This MultiTerm database contains 217058 entries, has 148 MB (zipped: 14 MB) and does not really slow down the translation, because the search in the TM works independently of the MultiTerm search, which takes just one or two seconds longer than the search in the TM.

I hate to admit that this is much faster than the Déjà Vu search in the same transformed MS glossary, which takes too long to let Déjà Vu insert the terms into the translation, but it is still fast enough if you disable the automatic insertion and use it like MultiTerm. This way it takes 3-6 seconds which gives you a good chance that you can see the hits before you will have finished the segment...

Maybe it would work faster if I would transform it into a Déjà Vu Lexicon instead of a Terminology Database. I did not test this yet, because I think I would have to refine the database much more before it would be suitable for a Lexicon.

Good speed,

[Edited at 2004-08-08 23:51]

Direct link Reply with quote

Victor Sidelnikov  Identity Verified
Russian Federation
Local time: 07:28
Member (2004)
English to Russian
+ ...
No problems Aug 9, 2004

I have some termbases with 130,000 terms. Volume - 60-70 MB. No slowdown, no any failures.
Broadly speaking 5000-15000 term - this is usial size of dictionary, you can't to observe search in a such termbase.

Direct link Reply with quote
Haluk Levent Aka
Local time: 07:28
Japanese to Turkish
+ ...
Thanks for all replies Aug 11, 2004

Thank you for your replies and comments. I'm much relieved that termbases with 5 - 15 thousand entries will not cause delay.


Direct link Reply with quote

To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

Giant(?) Multiterm Databases: Are they feasible?

Advanced search

Translation news related to SDL Trados

Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »

All of
  • All of
  • Term search
  • Jobs