mediamatrix wrote:
Thanks for your comments, Jeff. I hesitate to differ with an acknowleged expert in MT but, for what it's worth, here are just a few of the 'inconsistencies' I found on a first reading of the questionnaire, and now confirmed on a second and third reading, and having looked (again) at
www.eamt.org:[/quote] mediamatrix wrote:
Q3 Options A, B and C refer to CAT, not to MT as such.
agata wrote in the questionnaire:
3. Which of the following statements, in your opinion, describes the notion of machine translation best?
A. machine translation remains a useful facility that substitutes humans in the task of translating
B. machine translation remains an effective facility only with the assistance of human translators
C. machine translation offers practical help for the translator but does not complete the task of translation
I recommend reading the following posts that I have previously made on this topic:
Definition of CAT
http://www.proz.com/post/184880#184880 &
MT is part of CAT & MT postediting is worthwhile
http://www.proz.com/post/189873#189873 mediamatrix wrote:
Q4 Option D is a 'writing aid' and not, in itself, an MT or even a CAT tool.
The other options may be used in CAT or MT, with entirely different results in each case.
agata wrote in the questionnaire:
4. Which of the following MT aids do you use in your everyday life?
A. on-line dictionaries
B. term banks
C. glossaries
D. word processors
E. translation workstations
F. others:……………
This question does not seem to ask if these are MT tools or CAT tools, but rather if they are used as aids to MT.
On-line dictionaries are used for building MT dictionaries. Same with term banks and glossaries. Word processors are typically used for MT, and I specifically stated in my book review (
http://www.multilingual.com/allen46.htm) concerning a book on the topic of MT postediting published in 2001 that it referred to MT processing and postediting techniques based on paper evaluation processes that were used in the late 1980s and early 1990s.
Even in the mid-1990s at Caterpillar, many of the translators using the MT systems did their assessment on print-out copies of the documents.
Translation Workstations have been part of the discussion of MT usage for many years. See:
http://cslu.cse.ogi.edu/HLTsurvey/ch8node6.html http://ourworld.compuserve.com/homepages/WJHutchins/Conchist.htm http://www.aymara.org/biblio/mtranslation.pdf http://acl.ldc.upenn.edu/X/X96/X96-1024.pdf http://ethesis.helsinki.fi/julkaisut/hum/yleis/vk/hoge/towardsa.pdf mediamatrix wrote:
Q8 Options A and B are MT (by EAMT's definitions) and Option C is CAT.
agata wrote in the questionnaire:
8. What is the way you use machine translation?
A. machine translation as a translation aid for full-quality human translation
B. raw translation for information scanning
C. raw translation as a drafting aid, i.e. use of MT for translating rough versions of a text from the target language into the source language
I specifically described the difference between Content gisting (inbound translation, also referred to as assimilation in MT circles) and translation for publication (outbound Translation, also referred to as dissemination in MT circles) in the following:
Inbound vs Outbound MT systems and software
http://www.proz.com/post/190126#190126 And a description of each level of working with the MT output (known as MT postediting) is in my book chapter on the topic:
Post-editing. In Computers and Translation: A Translators Guide. Benjamins Translation Library, Number 35.
http://www.geocities.com/mtpostediting/PE-book-chapter-jeff-allen.pdf mediamatrix wrote:
Q9 Option A - The definition of pre-editing is incorrect (pre-editing is the adaptation of the source text to increase the likelihood of it being undertood correctly by the MT system).
agata wrote in the questionnaire:
9. Which of the following techniques do you use for revision of translation?
A. pre-editing, i.e. checking the input text and adjusting it to satisfy the expectations of its receiver
On this point, your definition is closer to the one normally used. Since 2001, I have also started to refer to MT dictionary building as a pre-editing (or rather an MT pre-processing) step which can be done before MT processing or in parallel with it.
mediamatrix wrote:
Q10 greatest advantage? - compared to what?
On this point, the answers show that she means "what are the key diffenciators, or selling points, for using MT in your context?"
mediamatrix wrote:
Q11 effective? - in terms of what? (cost, time, accuracy, ROI, ....)
Yes, objectives, targets, and measurements need to be set. In many cases, as I have written in a couple of articles, these are not set. Two of my current articles in progress and nearly ready to be submitted will deal with these issues in more detail.
mediamatrix wrote:
Q14/Q15 what about 'flagrant errors of comprehension' ?
It appears that she was taking the assessment criteria from standards like J2450 or Blackjack. I agree that additional criteria, like the one you mentioned, could be added. However, the criteria she listed are objective, measurable items.
agata wrote in the questionnaire:
15. Which of them affect the quality of translation most?
A. grammatical errors
B. lexical errors
C. stylistic errors
D. punctuation errors
E. omissions
F. incompleteness
"Flagrant errors of comprehension" is in itself a subjective evaluation-based statement, which can be determined based on assessing a combination of the measurable points she listed in that question. I understand that what you are trying to say is "this is junk output that I don't understand". But how do we evaluate "excellent translation" versus "very good translation" versus "good translation" versus "acceptable attempt" versus "half-way understandable" versus "total junk"? It is difficult to establish a value level for subjective criteria and use it consistently, unless such an item can be calculated from other measurable sub-criteria.
Thanks for your feedback and discussion on this topic. It is encouraging to see that you have taken the time to look at the questionnaire, and carefully consider it compared with other information available elsewhere on this general topic.
This also shows that surveys need to be carefully created and presented. My experience in creating and administering about 10 different language technology surveys confirms that they more time and effort invested up front to making the survey understandable and easy-to-fill-in will make the survey/questionnaire more valuable for gathering accurate and useful responses.
I hope that Agata can benefit from your comments.
Jeff
[Edited at 2006-04-16 22:49]