170 likes | 315 Views
Applying Machine Translation Metrics to Student-Written Translations. Lisa N. Michaud Computer Science Department Merrimack College North Andover, Massachusetts, USA Patricia Ann McCoy Language Department Universidad de las Americas Puebla Puebla , Mexico. Criteria for judging translations.
E N D
Applying Machine Translation Metrics to Student-Written Translations Lisa N. MichaudComputer Science DepartmentMerrimack CollegeNorth Andover, Massachusetts, USA Patricia Ann McCoyLanguage DepartmentUniversidad de las AmericasPueblaPuebla, Mexico
Criteria for judging translations fluency(is it well-formed?) fidelity (does it convey original meaning?) (Hovy et al., 2002) Michaud and McCoy
Multiplicity of translations • In each one of these jobs the professor could have agreed to work 6 hours a day and therefore would not be surpassing the working day hour limit. • In each one of these jobs the teacher could have agreed to work 6 hours per day and therefore he wouldn't be bound by the limits of the working day. • In each of these examples the teaching could have been arranged so that he/she works six hours a day and would not be affected by any workday limitations. • In both of these jobs the professor could have agreed to work six hours daily and therefore he wouldn't be affecting his work shift limit. Michaud and McCoy
Multiplicity of translations • In each one of these jobs the professor could have agreed to work 6 hours a day and therefore would not be surpassing the working day hour limit. • In each one of these jobs the teacher could have agreed to work 6 hours per day and therefore he wouldn't be bound by the limits of the working day. • In each of these examples the teaching could have been arranged so that he/she works six hours a day and would not be affected by any workday limitations. • In both of these jobs the professor could have agreed to work six hours daily and therefore he wouldn't be affecting his work shift limit. Michaud and McCoy
BLEU Hypothesis Multiple References Michaud and McCoy
TERp Hypothesis INSERTION PHRASALEQUIVALENCE or SYNONYM SAMESTEM SUBSTITUTION SHIFT Single Reference Michaud and McCoy
TERp alignment and tags Michaud and McCoy
Student translation corpus Michaud and McCoy
Does TERp agree with an expert? Instructor Scores vs Inverted TERp 650 sentences (22%) Pearson Correlation r = 0.232236 Michaud and McCoy
Score distribution Michaud and McCoy
Instructor rubric (original) 10 Excellent 9 Good 8Satisfactory 0-7Deficient Michaud and McCoy
Evaluating TERp tags (pilot) Michaud and McCoy
Future work Michaud and McCoy
Instructor rubric (revised) 100 Excellent 90 Good 80 Satisfactory 0-70 Deficient Michaud and McCoy
Modifying the TERp Score Hypothesis INSERTION PHRASALEQUIVALENCE or SYNONYM SAMESTEM SUBSTITUTION SHIFT Single Reference Michaud and McCoy
Recognizing false cognates Hypothesis cynical Source cínico SUBSTITUTION brazen Single Reference Michaud and McCoy
Extracting mistranslation pairs SPANISHDICTIONARY ENGLISHDICTIONARY cynical cynicalbrazen cínico zona zone Michaud and McCoy