Concordancing the DGT with lightning speed
Thread poster: xxx2nl

xxx2nl  Identity Verified
Netherlands
Local time: 11:47
Oct 2, 2014

Source: http://cafetran.wikidot.com/changelog

2014-09-29 Significant performance improvements in the H2 SQL database. New indexing system; existing tables won’t work. You need to delete the existing tables. Next, create a new table and import your data again. On an i3 laptop it takes 2-3 seconds to complete a search in 1.5 million TUs.

Within seconds, your gigantic database is queried and all matches are displayed:

https://www.youtube.com/watch?v=uhBS5MLNIBg

The database used in the demonstration movie contains 2 million TUs from the DGT. Note that 'Verordnung' and 'Kommission' are very frequent in the DGT, so collecting all hits takes more time.

More info: http://cafetran.wikidot.com/working-with-external-databases


Direct link Reply with quote
 

FarkasAndras
Local time: 11:47
English to Hungarian
+ ...
I wonder... Oct 3, 2014

... how this compares to TMLookup (which uses sqlite without any added special sauce).
It does look very fast on the youtube video. Care to do a comparison test?


Direct link Reply with quote
 

xxx2nl  Identity Verified
Netherlands
Local time: 11:47
TOPIC STARTER
Michael's at it Oct 3, 2014

Michael Beijer is doing some tests with his huge resources (45 GB). I'm quite sure he will report as soon as he's ready.

Direct link Reply with quote
 

Meta Arkadia
Local time: 16:47
English to Indonesian
+ ...
MySQL Oct 3, 2014

FarkasAndras wrote:
...which uses sqlite without any added special sauce

Igor is going to add MySQL soon, and I suppose SQLite will be next. That would be a nice test, CT SQLite against TMLookup SQLite.

Cheers,

Hans


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Natalie[Call to this topic]

You can also contact site staff by submitting a support request »

Concordancing the DGT with lightning speed

Advanced search






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
memoQ translator pro
Kilgray's memoQ is the world's fastest developing integrated localization & translation environment rendering you more productive and efficient.

With our advanced file filters, unlimited language and advanced file support, memoQ translator pro has been designed for translators and reviewers who work on their own, with other translators or in team-based translation projects.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search