專題進度報告

專題進度報告 資工四 B93902054 洪志豪資工四 B93902076 林宜鴻

Flow chart

Language Model • 1.Normalize(UTF-8) -> normalize_utf8 • 2.From UTF-8 to Big5 -> iconv • 3.parse according to dictory -> textSegment • 4.Build language model -> ngram-count

Acoustic Model - HERest • 1.meaning：Adjust HMM model • 2.command: HERest –C herest.cfg –S train.scp –L /share/data/TrainingLabel –H hmmset.mmf rcd.lst • 3.Run more can get higher accuracy

Combine Acoustic and Language Model(1) pre-requirement • 1.change the bigram.lm from Big5 to ascii • 2.command： /share/program/big52ascii/chenc bigram_Big5.lm > bigram_ascii.lm • 3.replace <s> with !=ENTER </s>with !=EXIT A useful command to replace: ->:1,$ s/<s>/!ENTER/g :1,$ s/<\/s>/!EXIT/g

Combine Acoustic and Language Model(2)HBuild • 1.meaning：change the SRILM form to HTK network form • 2.command： HBuild –n bigram_ascii.lm htkword.dic wd_net • 3.htkword.dic is the relation between work and pronunciation EX:[B7C2] k_e e • 4.wd_net is that word_net that we need EX:I=9996 W=[A64C][C3D2] J=570711 S=9551 E=2135 l=-7.52

Combine Acoustic and Language Model(3) HBuild(Cond.) - Questions • htkword.dic has no word that corresponds to <unk> <sol>: don’t add flag –u when training language model, but is it appropriate?

Decode-HVite(1) • 1.meaning：decode testing corpus • 2.command： HVite –t 250 –H hmmset.mmf –S testing.scp –C hvite.cfg –w wd_net -i result.mlf –p 0.0 –s 0.0 htkword.dic rcd.lst • 3.result is in result.mlf EX：0 1100000 !ENTER -1042.10318 1100000 2200000 [B37D] -734.694397

Decode-HVite(2) - Questions • If we use HERest to train hmmset.mmf with phone.lst，it will error： ->cannot load file n_o <sol>:use rcd.lst instead

Divide Mixtures by HHEd • But hmmset.mmf given by TA already has 32 mixtures for each state • What config to use? Old DSP hw: the same as what HERest use My trail: it can run without config

Initailize Acoustic Model • 2 methods for 2 different conditions With labels: use HInit and HRest Without labels: use HCompV • Both of them has a similar question

Reference • 1.HTK_Book.pdf • 2.SpeechProject2007Winter.ppt • 3.dsp_hw1.ppt(2006_spring_DSP) • 4.HTK_Training.pdf • 5.SpeechProj board in ptt2

專題進度報告

專題進度報告

Presentation Transcript