The Basics of Quality Estimation
Thread poster: Juan Martín Fernández Rowda

Juan Martín Fernández Rowda  Identity Verified
United States
Local time: 00:15
English to Spanish
+ ...
Sep 1, 2016

Sharing one of my posts recently published on the GALA blog for those interested in quality estimation:

https://www.gala-global.org/blog/basics-quality-estimation#sthash.Lzyi78eK.Dj7Ydxt5.dpbs


 

Samuel Murray  Identity Verified
Netherlands
Local time: 09:15
Member (2006)
English to Afrikaans
+ ...
@Juan Sep 1, 2016

Juan Martín Fernández Rowda wrote:
Sharing one of my posts recently published on the GALA blog for those interested in quality estimation:
https://www.gala-global.org/blog/basics-quality-estimation#sthash.Lzyi78eK.Dj7Ydxt5.dpbs


So, if I understand correctly, the QE designer would figure out which things make a sentence less likely to be translated well, based on the language combination. Right?

So for example, if it is known that sentences in Upper Slobovian that contain many nouns will likely produce a poor translation in Mid Slobovian but likely produce a reasonably accurate translation in Lower Slobovian, then such sentences would be graded with a low QE score if the target language is Mid Slobovian (which means that they'll go to the top of the pile of segments to be checked by a human) but a high QE score if the target language is Lower Slobovian (which means they'll move to the bottom of that pile). Do I understand this correctly?

So QE is not an evaluation of an existing translation by comparing it with a human translation, but a guess about the quality of the translation based on the elements of sentences in a given language combination. Yes?


 

Juan Martín Fernández Rowda  Identity Verified
United States
Local time: 00:15
English to Spanish
+ ...
TOPIC STARTER
@ Samuel Sep 1, 2016

Samuel Murray wrote:

So, if I understand correctly, the QE designer would figure out which things make a sentence less likely to be translated well, based on the language combination. Right?

So for example, if it is known that sentences in Upper Slobovian that contain many nouns will likely produce a poor translation in Mid Slobovian but likely produce a reasonably accurate translation in Lower Slobovian, then such sentences would be graded with a low QE score if the target language is Mid Slobovian (which means that they'll go to the top of the pile of segments to be checked by a human) but a high QE score if the target language is Lower Slobovian (which means they'll move to the bottom of that pile). Do I understand this correctly?

So QE is not an evaluation of an existing translation by comparing it with a human translation, but a guess about the quality of the translation based on the elements of sentences in a given language combination. Yes?



Thanks for your interest, Samuel. Here's a very simplistic explanation: QE is based on language combination, yes. Now, the QE designer, so to call it, is usually a scientist - not a lot of linguists can deal with this. The main reason is that it's a complex process. There are standard set of features (the features are what indicate the potential presence of an issue in a sentence) and there are more specific ones. Knowing which features you want to use, a QE model is created using supervised machine learning on parallel data (source, target). So these features are "learned" from already translated text. That model can then be create to estimate the quality of other texts for which you don't have a reference translation.

What you say is correct - QE is only a prediction of the quality, not an evaluation of it. However, we always need to keep in mind that with language, 1+1 is not always 2. I did some work on quality estimation based only on linguistic features, with the sole purpose of making QE more accessible to translators and linguists. I think I posted links in this forum.

If you want to learn more, let me recommend some papers:
Machine Translation Quality Estimation Across Domains - http://www.aclweb.org/anthology/C14-1040
Confidence estimation for Machine Translation - http://web.eecs.umich.edu/~kulesza/pubs/confest_coling04.pdf
Estimating the Sentence-Level Quality of Machine Translation Systems - http://www.mt-archive.info/EAMT-2009-Specia.pdf


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


The Basics of Quality Estimation

Advanced search







CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »
SDL Trados Studio 2017 only €435 / $519
Get the cheapest prices for SDL Trados Studio 2017 on ProZ.com

Join this translator’s group buy brought to you by ProZ.com and buy SDL Trados Studio 2017 Freelance for only €435 / $519 / £345 / ¥63000 You will also receive FREE access to Studio 2019 when released.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search