1 / 4

Speed-up Facilities in s3.3

Speed-up Facilities in s3.3. GMM Computation. Seach. Lexicon Structure. Tree. Pruning. Standard. Heuristic Search Speed-up. Not Implemented. Frame-Level. Not implemented. Senone-Level. Not implemented. Gaussian-Level. SVQ-based GMM Selection Sub-vector constrained to 3.

Download Presentation

Speed-up Facilities in s3.3

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Speed-up Facilities in s3.3 GMM Computation Seach Lexicon Structure Tree. Pruning Standard Heuristic Search Speed-up Not Implemented Frame-Level Not implemented Senone-Level Not implemented Gaussian-Level SVQ-based GMM Selection Sub-vector constrained to 3 Component-Level SVQ code removed

  2. Summary ofSpeed-up Facilities in s3.4 GMM Computation Seach Lexicon Structure Tree Pruning (New) Improved Word-end Pruning Heuristic Search Speed-up (New) Phoneme-Look-ahead Frame-Level (New) Naïve Down-Sampling (New) Conditional Down-Sampling Senone-Level (New) CI-based GMM Selection Gaussian-Level (New) VQ-based GMM Selection (New) Unconstrained no. of sub-vectors in SVQ-based GMM Selection Component-Level (New) SVQ code enabled

  3. Near Term Improvement of Decoder • Improve LM facilities (Avail at Mar 31) • Improve speed-up techniques (Avail at Mar 31) • Complete phoneme look-ahead research • Complete machine optimization in Intel platform • Enable speed-up in live-mode recognition. (Avail at Mar 31) • Improved search structure • Modify code to use lexical tree copies (Apr 15) • Modify code to handle cross-word triphones (Apr 30)

  4. Training Plan • Text-Processing (Avail at Mar 31) • First Pass of Acoustic/Language Modeling (Avail at Apr 15) • With the help of the new 4 cpus machine. • Training using standard recipe • CD + CI mode first pass models. • Trigram models. • Second Pass of Acoustic/Language Modeling • Improved training. • Decide what we should do after we get the results. • AM/LM Adaptation? (Don’t know yet)

More Related