320 likes | 326 Views
This chapter discusses the differences between generative and discriminative classifiers, specifically focusing on the Perceptron model and Hidden Markov Models (HMMs). It explores the use of HMMs in reasoning over time and introduces concepts such as conditional independence, stationary distributions, and the Forward and Viterbi algorithms. Additionally, it discusses the application of HMMs in speech recognition, robot localization, and web link analysis.
E N D
CHAPTER 8 DISCRIMINATIVE CLASSIFIERSHIDDEN MARKOV MODELS
Reasoning over Time • Often, we want to reason about a sequence of observations Speech recognition Robot localization User attention • Need to introduce time into our models • Basic approach: hidden Markov models (HMMs) • More general: dynamic Bayes’ nets
Stationary Distributions • If we simulate the chain long enough: What happens? Uncertainty accumulates Eventually, we have no idea what the state is! • Stationary distributions: For most chains, the distribution we end up in is independent of the initial distribution Called the stationary distribution of the chain Usually, can only predict a short time out
Viterbi Properties • Designed for computing the most likely state hidden sequence given a sequence of observations in Hidden Markov Models • Two passes, forward to compute the forward probabilities, and then backward to reconstruct the maximum sequence • What’s the time complexity? • O(d2n) - Why is this exciting? • There are many extensions to the basic Viterbi algorithm which have been developed for other models which have similar local structure: syntactic parsing, for instance.
HMMs for Continuous Obs.? • Before: discrete, finite set of observations • Now: spectral feature vectors are real-valued! • Solution 1: discretization • Solution 2: continuous emissions models Gaussians Multivariate Gaussians Mixtures of Multivariate Gaussians • A state is progressively: Context independent subphone (~3 per phone) Context dependent phone (=triphones) State-tying of CD phone
Conclusion • Perceptron A discriminative model, an alternative to generative models like Naïve Bayes Simple classification rule, based on a weight vector Simple online learning algorithm, guaranteed to converge if training set is separable • Hidden Markov Models A special kind of Bayesian Network designed for reasoning about sequences of hidden states Polynomial time inference for most likely state sequence (Viterbi) and marginalization (Forward- Backward) Many applications