1 / 61

Statistical Machine Translation Part III – Phrase-based SMT / Decoding

Explore phrase-based translation, log-linear models, and automatic tuning for decoding in statistical machine translation. Learn about language models, complexity management, recombination, pruning, and future cost estimation.

rarrington
Download Presentation

Statistical Machine Translation Part III – Phrase-based SMT / Decoding

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Statistical Machine TranslationPart III – Phrase-based SMT / Decoding Alex Fraser Institute for Natural Language Processing University of Stuttgart 2008.07.23 EMA Summer School

  2. Outline • Phrase-based translation • Log-linear model • Tuning log-linear model • Decoding

  3. Slide from Koehn 2008

  4. Slide from Koehn 2008

  5. Language Model • Usually a trigramlanguage model isusedfor p(e) • P(the man wenthome) = p(the | START) p(man | START the) p(went | the man) p(home | man went) • Language modelswork well forcomparingthegrammaticalityofstringsofthesame length • However, whencomparingshortstringswithlongstringstheyfavorshortstrings • Forthisreason, a veryimportantcomponentofthelanguage model isthelengthbonus • Thisis a constant > 1 multipliedforeach English word in thehypothesis

  6. d Modified from Koehn 2008

  7. Slide from Koehn 2008

  8. Slide from Koehn 2008

  9. Slide from Koehn 2008

  10. Slide from Koehn 2008

  11. Slide from Koehn 2008

  12. Slide from Koehn 2008

  13. Slide from Koehn 2008

  14. Slide from Koehn 2008

  15. Slide from Koehn 2008

  16. Slide from Koehn 2008

  17. Slide from Koehn 2008

  18. Outline • Phrase-based translation • Log-linear model • Tuning log-linear model • Decoding

  19. Slide from Koehn 2008

  20. Slide from Koehn 2008

  21. Slide from Koehn 2008

  22. Slide from Koehn 2008

  23. Slide from Koehn 2008

  24. Slide from Koehn 2008

  25. Slide from Koehn 2008

  26. Slide from Koehn 2008

  27. Outline • Phrase-based translation model • Log-linear model • Tuning log-linear model automatically • Decoding

  28. Outline • Phrase-basedtranslation model • Log-linear model • Tuning log-linear model automatically • Decoding • Basic phrase-baseddecoding • Dealingwithcomplexity • Recombination • Pruning • Future costestimation • Decoding output

  29. Slide from Koehn 2008

  30. Slide from Koehn 2008

  31. Slide from Koehn 2008

  32. Slide from Koehn 2008

  33. Slide from Koehn 2008

  34. Slide from Koehn 2008

  35. Slide from Koehn 2008

  36. Slide from Koehn 2008

  37. Slide from Koehn 2008

  38. Slide from Koehn 2008

  39. Slide from Koehn 2008

  40. Slide from Koehn 2008

  41. Slide from Koehn 2008

  42. Slide from Koehn 2008

  43. Slide from Koehn 2008

  44. Slide from Koehn 2008

  45. Slide from Koehn 2008

  46. Slide from Koehn 2008

  47. Slide from Koehn 2008

  48. Slide from Koehn 2008

  49. Slide from Koehn 2008

  50. Slide from Koehn 2008

More Related