BOOSTED MMI FOR MODEL AND FEATURE-SPACE DISCRIMINATIVE TRAINING

BOOSTED MMI FOR MODEL AND FEATURE-SPACE DISCRIMINATIVE TRAINING Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon and Karthik Visweswariah IBM T.J. Watson Research Center Yorktown Heights, NY, USA Presented by Yueng-Tien ,Lo

Outline • MMI review • BMMI • improvements to Model-Space updates • Experiments Results • conclusion

MMI review

BOOSTEDMMI • Enforce a soft margin that is proportional to the number of errors in a hypothesised sentence • Boost the likelihood of the sentences that have more errors thus generating more confusable data • b : boosting factor : accuracy of a sentence s given the reference Sr

IMPROVEMENTS TO MODEL-SPACE UPDATES(1/2) • Canceled statistics • cancel the statistics accumulated on each frame from the numerator and denominator • The only effect this canceling has is to change the Gaussian-specific learning-rate constant

IMPROVEMENTS TO MODEL-SPACE UPDATES(2/2) • I-smoothing to previous iteration • back off to the MMI estimate (based on one iteration of EBW starting from the current iteration’s statistics). • Rules for obtaining to be the largest of: ( I ) the typical case: ( I I )double the smallest • Lead to being positive definite] • I-smoothing to the previous iteration can be better than I-smoothing to the ML estimate

Experimental conditions

EBN50: MPE vs (B)MMI, 4th iter, =0.053 • 0.6% from canceling statistics • 0.9% from boosting • The baseline is MPE I-smoothed to MMI (MPE -> MMI).

EBN700 setup: MPE vs (B)MMI • BMMI-c->ML is better than MPE->MMI • 0.3% from boosting ,0.7% from canceling of statistics

EBN50: I-smoothing -> ML vs -> previous iteration(=0.053) • I-smoothing to the previous iteration rather than the ML estimate • Smoothing to the previous iteration is better, by 0.2% for MPE and 0.4% for BMMI-c

conclusion • We have Introduced… A modification to the MMI objective function - Error-boosting A modification to the MMI training procedure – the canceling of statistics • We have experimented with a totally MMI-based model-space discriminative training procedure and found that it sometimes but not always leads to substantial improvements over our previous MPE-based procedure

BOOSTED MMI FOR MODEL AND FEATURE-SPACE DISCRIMINATIVE TRAINING

BOOSTED MMI FOR MODEL AND FEATURE-SPACE DISCRIMINATIVE TRAINING

Presentation Transcript

Ch 5b: Discriminative Training (temporal model)

LECTURE 33: DISCRIMINATIVE TRAINING

Feature Model Merging Algorithms

LECTURE 32: DISCRIMINATIVE TRAINING

Discriminative Model Checking

Discriminative Feature Optimization for Speech Recognition

BOOSTED MMI FOR MODEL AND FEATURE-SPACE DISCRIMINATIVE TRAINING

MMI

A Discriminative Syntactic Word Order Model for MT

Discriminative Training Approaches for Continuous Speech Recognition

Large scale discriminative training for speech recognition

Support for Collaborative Feature Model Configuration

Minimum Phone Error (MPE) Model and Feature Training

A Discriminative Alignment Model for Abbreviation Recognition

LECTURE 31: DISCRIMINATIVE TRAINING

Survey ICASSP 2007 Discriminative Training

Paper space/Model space

Model Space and Paper Space

Feature space tansformation methods

Training of Boosted DecisionTrees