Anton Konashenok wrote:
Wow. A TM containing several million entries of unknown origin, unknown quality and not even guaranteed to be translations, merely parallel texts, automatically crawled and unlikely to have been reviewed by a human. How quaint.
Am going to add it to my huge TMX collection in Memsource!
If the word "collection" meant a cabinet of curiosities, it would certainly make an interesting exhibit. But a useful working tool? Don't think so. In fact, publicly admitting to using it for translation work is a reputational risk – potential clients may consider it unscrupulous.
I have no problem admitting I use massive TMs found online. I also use machine translation. And a mouse. What else is new?
Of course there may be lots of noise in these huge databases, but also lots of little treasures. Have you ever run a concordance search? It provides you with a nice list of possible translations. The art is of course to be able to separate the wheat from the chaff, which I am actually quite good at.
I obviously don't just accept high fuzzy matches from these massive TMs, willy nilly, and race on to the next segment. In fact, high fuzzy matches hardly ever occur between texts in everyday work. What I mainly use these databases for is to find difficult terms, or at least pointers in the right direction. Because of their size (altogether my TM collection contains currently around 50,000,000 TUs), you are often quite likely to run into the term you're having trouble with. You can then use this to do further research, although what you find in your first search will often already be correct.
Anyway, as with any tool, you need to know how to use it.