1 / 18

Evaluation of the Statistical Machine Translation Service for Croatian-English

Evaluation of the Statistical Machine Translation Service for Croatian-English. Marija Brkić Department of Informatics, University of Rijeka mbrkic@uniri.hr Tomislav Vičić Freelance teacher of economics and translator ssimonsays@gmail.com Sanja Seljan Department of Information Sciences

nitesh
Download Presentation

Evaluation of the Statistical Machine Translation Service for Croatian-English

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Evaluation of the Statistical Machine Translation Service for Croatian-English Marija Brkić Department of Informatics, University of Rijeka mbrkic@uniri.hr Tomislav Vičić Freelance teacher of economics and translator ssimonsays@gmail.com Sanja Seljan Department of Information Sciences Faculty of Humanities and Social Sciences, University of Zagreb sanja.seljan@ffzg.hr

  2. Machine translation Syntactic transfer Example-based translation Statistically-based translation Evaluation Manual Automatic Experimental study Google Translate Service (Croatian → English) Comparison and analysis Manual evaluation Conclusion Outline

  3. I.Machine translation basics Speeding uptranslation process Limited human component Multilingual access to written material Limited capabilities Help in discovering general idea behind text For limited use only

  4. Approaches Word-for-word Syntactic transfer* Interlingua Controlled language Example based* Statistically based* Various combinations

  5. Syntactic transfer Involves some linguistic rules Analyzes sourcesand translates using intermediary linguistic representations Usage still limited for particular purposes (i.e. scientific, marketing, etc.) Examples: Systran, Eurotra, Metal

  6. Example-based Uses blocks of words (example sentences) Utilizes analogy principle Needs to be fed with info System “learns” during augmenting stage Suitable for structurallycompletely different languages Example: translation memories

  7. Statistically-based (a.k.a. SMT) Utilizes statistical models Parameters derived from bilingual corpora Phrases as n-grams (n is number of terms in a phrase) Requires vast quantities of matched bilingual texts Outputsmost likely match inputs Does not applylinguistic rules Attempts to match language patterns 

  8. Problems Modeling / Learning / Decoding Approaches Word-based / Phrase-based / Syntax-based Example Google Translate Service Statistically based (cont.) 

  9. Manual Humanbilingual or monolingual evaluators score outputs according to fluency (grammar) and adequacy (preservation of information) Time-consuming, expensive and very subjective Automatic(BLEU, METEOR, etc.) Reference translations Goalhigher degree of correlation with human judgements II. Evaluation

  10. III. Experimental study Croatian – English “Very odd couple” A lot of systematic, idiosyncratic and lexical differences Three types of texts: Corpus linguistics, annotation and research methods Enterprises and Government's reform plan Washing machine manual

  11. Google translate web UI

  12. SMT Offers Croatian as source and target language Statistically based Monolingual target language texts Aligned texts (human translations) Fluency and adequacy highly depend on available corpora

  13. Reference translations vs. candidate translations Levels of analysis: Lexical (misuse of words, zerotones) Morphological (wrong word forms) Syntactic (word order) Semantic (preservation of original message) Usage of punctuation marks Task

  14. Manual Evaluation Procedure 6 bilingual evaluators and 21 sentences (machine translation and reference translation) 1 – 5 scale Fluency Adequacy 1 incomprehensible none 2 disfluent English little meaning 3 non - native English much meaning 4 good English most meaning 5 flawless English all meaning

  15. Hypotheses There is no significant difference in assigningscore 3 according to both criteria (fluency and adequacy). There are no significant differences in assigning score 3 to fluency and adequacy per each evaluator. Chi-square Test (2)

  16. There is no significant difference in assigning score 3 according to both criteria. There are no significant differences in assigning score 3 to fluency and adequacy per each evaluator. There are significant differences in assigning score 3 to fluency and adequacy for half of the evaluators. Results Fluency and adequacy per average judgements 4 Fluency 2 Adequacy 0 1st 2nd 3rd 4th 5th 6th

  17. Usage: Basic information transfer Personal use only Improvements: Integration with language dependent modules Human post-editing Greater number of evaluators needed IV. Conclusion

  18. Thank you on your attention!

More Related