專題研究 (3) Viterbi Decoding Triphone Acoustic Model

專題研究 (3)Viterbi DecodingTriphone Acoustic Model Prof. Lin-Shan Lee, TA. Yun-Chiao Li

Viterbi Decoding 03.04.mono0a.viterbi.sh 04.04.tri1.viterbi.sh

Viterbi Decoding • Instead of using WFST, we use Viterbi now • Converted Kaldi Acoustic model to HTK by Vulcan • (02.02.convert.htk.feat.sh) Convert the acoustic model from Kaldi to HTK

Viterbi Decoding Using the dev set to find the best acoustic weight (acwt)

Triphone Acoustic Model 04.01~04.04

Triphone Acoustic Model • In monophone acoustic model, • ㄅ、ㄆ、ㄇ they use their own model • In triphone acoustic model, • ㄅ-ㄆ-ㄇ is a model • There will be too many model and lack of training data

Decision Tree • Use decision tree to tie similar models together

04.01.tri1.train.sh (1/3) • It is very similar to 03.01

04.01.tri1.train.sh (2/3)

04.01.tri1.train.sh (3/3)

Homework bash 04.01.tri1.train.sh bash 04.02.tri1.mkgraph.sh bash 04.03.tri1.fst.sh bash 04.04.tri1.viterbi.sh

Some Helpful References • “使用加權有限狀態轉換器的基於混合詞與次詞以文字及語音指令偵測口語詞彙” – 第三章 • https://www.dropbox.com/s/dsaqh6xa9dp3dzw/wfst_thesis.pdf • Check HDecode, HLRescore in HTK Book

專題研究 (3) Viterbi Decoding Triphone Acoustic Model

專題研究 (3) Viterbi Decoding Triphone Acoustic Model

Presentation Transcript

Address Decoding

Viterbi IP

The Viterbi Algorithm

Decoding

Viterbi Decoder

What is this “Viterbi Decoding”

The viterbi algorithm

Viterbi Decoder Project

Viterbi School Representatives

Viterbi training

Towards Automated Acoustic Model Training

Viterbi Career Services

Encoding / Decoding

Human Cognition: Decoding Perceived, Attended, Imagined Acoustic Events and Human-Robot Interfaces

Accelerating Viterbi Algorithm

Decoding/Word Attack Use Decoding Strategies

ICASSP 2009: Acoustic Model Survey

專題研究 (2) Feature Extraction, Acoustic Model Training WFST Decoding

Decoding

Dynamic Programming Viterbi

Viterbi Decoder Project

Decoding B1