Concordancing the DGT with lightning speed
Thread poster: xxx2nl

xxx2nl  Identity Verified
Netherlands
Local time: 01:12
Oct 2, 2014

Source: http://cafetran.wikidot.com/changelog

2014-09-29 Significant performance improvements in the H2 SQL database. New indexing system; existing tables won’t work. You need to delete the existing tables. Next, create a new table and import your data again. On an i3 laptop it takes 2-3 seconds to complete a search in 1.5 million TUs.

Within seconds, your gigantic database is queried and all matches are displayed:

https://www.youtube.com/watch?v=uhBS5MLNIBg

The database used in the demonstration movie contains 2 million TUs from the DGT. Note that 'Verordnung' and 'Kommission' are very frequent in the DGT, so collecting all hits takes more time.

More info: http://cafetran.wikidot.com/working-with-external-databases


 

FarkasAndras
Local time: 01:12
English to Hungarian
+ ...
I wonder... Oct 3, 2014

... how this compares to TMLookup (which uses sqlite without any added special sauce).
It does look very fast on the youtube video. Care to do a comparison test?


 

xxx2nl  Identity Verified
Netherlands
Local time: 01:12
TOPIC STARTER
Michael's at it Oct 3, 2014

Michael Beijer is doing some tests with his huge resources (45 GB). I'm quite sure he will report as soon as he's ready.

 

Meta Arkadia
Local time: 06:12
English to Indonesian
+ ...
MySQL Oct 3, 2014

FarkasAndras wrote:
...which uses sqlite without any added special sauce

Igor is going to add MySQL soon, and I suppose SQLite will be next. That would be a nice test, CT SQLite against TMLookup SQLite.

Cheers,

Hans


 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Natalie[Call to this topic]

You can also contact site staff by submitting a support request »

Concordancing the DGT with lightning speed

Advanced search






TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search