1 / 39

Thai-English MT Project: Transfer Module

Thai-English MT Project: Transfer Module. Prachya Boonkwan IPA: /pratʃ ə ˑjaː bunˑkʰʷan/ NECTEC, Thailand. Outline. Introduction Analysis Module Transfer Module Generation Module Conclusion. Thai Analysis. Thai Sentence. English Generation. English Sentence. Introduction.

tpeacock
Download Presentation

Thai-English MT Project: Transfer Module

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Thai-English MT Project:Transfer Module Prachya BoonkwanIPA: /pratʃəˑjaː bunˑkʰʷan/NECTEC, Thailand

  2. Outline • Introduction • Analysis Module • Transfer Module • Generation Module • Conclusion CERDEC, NJ

  3. ThaiAnalysis ThaiSentence EnglishGeneration EnglishSentence Introduction • Thai-English Machine Translation Thai  EnglishTransfer CERDEC, NJ

  4. Introduction (cont’d) • Characteristics of Thai • Analytic language • Subject-Verb-Object pattern • Words written consecutively without space • Serial verb construction • No articles and no mass/concrete classification • Use of classifiers • Auxiliary words to express number, voice, tense, and aspect CERDEC, NJ

  5. Introduction (cont’d) • Issues of Thai-English translations • Summarized from Monthika’s observation • Different orderings between Thai and English • Different verb arguments between Thai and English • Implicit relations in Thai serial noun construction • Semantic duplication in Thai serial verb construction • No plural inflection in Thai • No inflection to express voices, tenses, and aspects in Thai CERDEC, NJ

  6. Introduction (cont’d) • Issue 1: Different orderings between Thai and English CERDEC, NJ

  7. Introduction (cont’d) • Issue 2: Different verb arguments between Thai and English chãnPRpaiVfràngsèsNLit: IPR goV FranceNTrans: IPR goVto FranceN phaanrowngN1yùtV1sùubV2bùrìiN2Lit: janitorN1 stopV1 smokeV2 cigaretteN2Trans: The janitorN1 stoppedV1smokingV2 cigaretteN2 CERDEC, NJ

  8. Introduction (cont’d) • Issue 3: Implicit relations in Thai serial noun construction CERDEC, NJ

  9. Introduction (cont’d) • Issue 4: Semantic duplication in Thai serial verb construction raayngaanV1 hâiPsâabV2Lit: reportV1 toP knowV2Trans: reportV1(to knowV2) khâaV1 hâiPtaayV2Lit:killV1 toP dieV2Trans:killV1(to dieV2) phûutV1 hâiPfangV2Lit:tellV1 toP listenV2Trans:tellV1(to listenV2) CERDEC, NJ

  10. Introduction (cont’d) • Issue 5: No plural inflection in Thai CERDEC, NJ

  11. Introduction (cont’d) • Issue 6: No inflection to express voices, tenses, and aspects in Thai CERDEC, NJ

  12. ThaiGrammar w1w2w3 Thai WordSegmentor ThaiParser ThaiSentence ThaiDep. Tree List of wordsand POSes Analysis Module • Overview of Analysis Module Analysis Module CERDEC, NJ

  13. < < > < < < < ส่งsendVT ทหารsoldierN อิรักIraqN ในinP แบลร์BlairN ไปtoP ปี 20032003N อังกฤษBritainN Analysis Module (cont’d) • Thai Parser: input and output CERDEC, NJ

  14. MappingTables w1w2w3 Thai-EngTransformation Leaf-NodeCollection ThaiDep. Tree List of lemmasannotated withsyntacticattributes EnglishDep. Tree Transfer Module • Overview of Transfer Module Transfer Module CERDEC, NJ

  15. Transfer Module (cont’d) • Attributes of Thai nouns CERDEC, NJ

  16. Transfer Module (cont’d) • Attributes of Thai verb CERDEC, NJ

  17. Transfer Module (cont’d) • Transfer operations CERDEC, NJ

  18. Transfer Module (cont’d) • Reordering (R) • Relocates constituents resulting quasi-English dependency tree • Attribute assignment (A) • Assigns English’s syntactic attributes to quasi-English tree • Insertion (I) & Deletion (D) • Inserts/deletes constituents to quasi-English dependency tree resulting English tree CERDEC, NJ

  19. Transfer Module (cont’d) • Transfer operations classified into groups CERDEC, NJ

  20. Attributeassignment Reordering Insertion& Deletion ThaiDep. Tree Quasi-EnglishDep. Tree EnglishDep. Tree Transfer Module (cont’d) • Steps of transfer operations CERDEC, NJ

  21. Transfer Module (cont’d) • Graphical notations: tree pattern < < > W2 only one depth any depth W3 > W1 W2 W4 W1 W1 < *W2 (*W3 > (*W4 > *W1)) < *W2 CERDEC, NJ

  22. Transfer Module (cont’d) • Graphical notation: transfer operation OPERATIONR < > > ADV N > N V ADV V (N > V) < *ADV --> N > (*ADV > V) {R} CERDEC, NJ

  23. Transfer Module (cont’d) • Demonstration > < < < > > < < CERDEC, NJ

  24. Transfer Module (cont’d) • Demonstration (cont’d) N < ADJ--> ADJ > N {R} > < < > > > < < CERDEC, NJ

  25. Transfer Module (cont’d) • Demonstration (cont’d) N < NUM--> NUM > N {R} > < > > > < > < CERDEC, NJ

  26. Transfer Module (cont’d) • Demonstration (cont’d) N < DET--> DET > N {R} > > > > > < > < CERDEC, NJ

  27. Transfer Module (cont’d) • Demonstration (cont’d) NUM > N--> NUM > N[+plu] {A} > > > > > < > < CERDEC, NJ

  28. Transfer Module (cont’d) • Demonstration (cont’d) DET > N[+plu]--> DET[+plu] > N[+plu]{A} > > > > > < > < CERDEC, NJ

  29. Transfer Module (cont’d) • Demonstration (cont’d) V < PR--> V < PR[+acc] {A} > > > > > < > < CERDEC, NJ

  30. Transfer Module (cont’d) • Demonstration (cont’d) PROG > V--> PROG > V[+prog]{A} > > > > > < > < CERDEC, NJ

  31. Transfer Module (cont’d) • Demonstration (cont’d) bark < PR -->bark < (at < PR){I} > > > > > < < > < CERDEC, NJ

  32. Transfer Module (cont’d) • Demonstration (cont’d) NUM < CL --> NUM {D} > > > > > < > < CERDEC, NJ

  33. Transfer Module (cont’d) • Demonstration (cont’d) CL > ADJ --> ADJ {D} > > > > > < < CERDEC, NJ

  34. Transfer Module (cont’d) • Demonstration (cont’d) PROG > V[+prog]--> V[+prog] {D} > > > < > < CERDEC, NJ

  35. Transfer Module (cont’d) • Demonstration (cont’d) > These three big dogs are barking at me. > > < > < Left-NodeCollection CERDEC, NJ

  36. Lemma MappingTables DiscourseStack DB w1w2w3 E1E2E3 W’1W’2W’3 Article Insertion List of lemmasannotated withsyntacticattributes Surface Word List English Output Sentence Surface Word Generation Generation Module • Overview of generation module Generation Module CERDEC, NJ

  37. ThaiAnalysis ThaiSentence ThaiDep. Tree EnglishGeneration EnglishSentence w1w2w3 List of lemmasannotated withsyntactic attributes Conclusion • Thai-English Machine Translation Thai  EnglishTransfer CERDEC, NJ

  38. Conclusion (cont’d) • Issues of Thai-English translation • Attributes of Thai lexical units • Generalized transfer operations • Reordering • Attribute assignment • Insertion • Deletion CERDEC, NJ

  39. Upcoming Work • Prologizing transfer operations • Generation Module CERDEC, NJ

More Related