130 likes | 304 Views
Non-contiguity phrase-based SMT. Hendra Setiawan. Outline. A brief introduction to SMT Non-contiguity for SMT Near-future work. SMT P ( e | f ). Introduction to SMT. f: damoy Pat Riley rano pashol. e: Pat Riley went home early. Alignment describes translation.
E N D
Non-contiguity phrase-based SMT Hendra Setiawan
Outline • A brief introduction to SMT • Non-contiguity for SMT • Near-future work
SMT P(e|f) Introduction to SMT f: damoy Pat Riley rano pashol e: Pat Riley went home early Alignment describes translation
damoy Pat Riley rano pashol home Pat Riley early went damoy Pat Riley rano pashol Pat Riley went home early Introduction to SMT • Translation process: • Lexical mapping • Reordering (at word level) (at word level)
Introduction to SMT • Translation process: (state-of-the-art) • Lexical mapping (at phrase level) • Reordering (at phrase level) damoy Pat_Riley rano pashol home Pat_Riley early went damoy Pat_Riley rano pashol Pat_Riley went home early
Outline • A brief introduction to SMT • Non-contiguity for SMT • Near-future work
Contiguity to Non-contiguity (1) • Lexical mapping (training): • Can translate “Pat_Riley” to “Pat_Riley” but not “not” to “ne… pas” • Impose contiguity constraint when extracting “Pat_Riley”. • Given automatic alignment: only 40% coverage (max windows length = 7) e1e2…………...el fm ... .……… f2f1
Contiguity to Non-contiguity (2) • Relax the contiguity constraint • Increase coverage from 40% to 53% e1e2…………...el fm ... .……… f2f1
No-ordering (monotone) Unconstrained (permutation) Local-ordering ITG (80%) SpG (95%) e1/f1 em/fm e1/f1 em/fm Contiguity to Non-contiguity (3) • Reordering
Gap : PLMMG(e|f) x PLMMG(f|e)x PJSCM([e,f]f) x Plm(e)xPfe(o,g|[e,f]f)xPef(o,g|[e,f]e) Pharaoh Gap english point-of-view english point-of-view PLMMG(e|f), Plm(e), Pef(o,g|[e,f]e) PLMM(e|f), Plm(e), PD(e|f) foreign point-of-view foreign point-of-view PLMM(f|e) PLMMG(f|e),Pjscm([e,f]f), Pfe(o,g|[e,f]f) Model Comparison Pharaoh : PLMM(e|f) x PLMM(f|e) x Plm(e) x PD(e|f) Lexical Mapping Reordering
Some close testing result BLEU Model combination
Outline • A brief introduction to SMT • Non-contiguity for SMT • Near-future work
Near-future work • Parameter estimation and backoff strategy for bigram model • Perform open testing and analyze the result