10 likes | 152 Views
Statistical Language Modeling (SLM); Computational Linguistics (CL). Surface (s) and hidden (h) components of language The p(s,h) function Statistical language modeling: estimating p(s) Distribution of words, sentences, documents Computational linguistics / NLP: estimating p(h|s)
E N D
Statistical Language Modeling (SLM); Computational Linguistics (CL) • Surface (s) and hidden (h) components of language • The p(s,h) function • Statistical language modeling: estimating p(s) • Distribution of words, sentences, documents • Computational linguistics / NLP: estimating p(h|s) • Classification vs. Regression vs. Density estimation • The source-channel model (aka, a Bayes classifier for everything) • SLM used as prior: speech, translation, spelling correction, OCR,... • SLM used as likelihood: document classification,... • [Probability: prior, posterior, Bayes' theorem, Bayes classifier]