100 likes | 300 Views
g-SEDIK flow chart. artis.jiang@gmail.com. File structure. 語音強化流程. 錄音檔. 雜訊估測(適應性空間濾波 /adaptive beamforming ). 前處理 : 估雜訊 +VAD. MCRA (Minima Controlled Recursive Averaging ). LTVAD. MS (Minimum Statist i cs). 語音強化 / 增益函數 (Post filtering). Wiener filter 噪音估測的結果 非常敏感, 較不穩健. SS
E N D
g-SEDIK flow chart artis.jiang@gmail.com
語音強化流程 錄音檔 雜訊估測(適應性空間濾波/adaptive beamforming) 前處理: 估雜訊+VAD MCRA (Minima Controlled Recursive Averaging ) LTVAD MS (Minimum Statistics) 語音強化/增益函數 (Post filtering) Wiener filter 噪音估測的結果 非常敏感, 較不穩健 SS (Spectral Subtraction) 較穩健且效果不錯 LSA (Log-Spectral Amplitude ) (Maximum likelihood) 結果較差 OMLSA 強化檔 最佳方法
input output function() data PESQ Winsound NB mic. Play Music Mp3/wav BF 辨識器 Mp3 player song_search() VAD Mic array [2] 顯示 歌名 %temp%\ raw.pcm MS SQL %temp%\ result.pcm PESQ pesqio fileIO pesqdsp VAD+filter dsp FFT+math pesqmain Show Score [0..5] Int PESQ_Score
input output function() data 公用函式,mic_scan() 存成待測語音多維數列 Hmic_ASRDlg.cpp %temp%\ _sedik_ch_1.pcm _sedik_ch_1.txt Winsound NB mic. Short int pSamples[2][128] _sedik_ch_2.txt %temp%\ xplot() 公用函式, 存成待測語音檔 Mic array [8] TF_GSC VAD vad_power_mean() %temp%\ result.pcm Float ch_buffer[8][128] BF 辨識器 HSR_Start_Offline_Recognizer() Noise canceller hamming FFT Wave2Fea. Waveform[i] 顯示歌名 IFFT Play Music Mp3/wav Call Mp3 player song_search() result[] HSR_Recog_result(~) MS SQL
Hmic_ASRDlg.cpp::tf_gsc_recog() Mic array R:\temp\ IFFT ch_buffer[8] TF_GSC result[] HSR_Start_Offline_Recognizer() Initial winsock BF VAD Initial G[] Noise canceller 顯示歌名 Play music result.pcm Produce MBF weightings by A 辨識器 Produce BM by A *.mdb access Call Mp3 player Recv data From MOD5270 Wave2Fea. Waveform[i] hamming MS SQL HSR_Recog_result(~) FFT FBF BM
Mic Scan/Testing:: C:\Microphone_Testing\Direct_Out channel_1.txt channel_2.txt channel_8.txt
VAD:: C:\Microphone_Testing\Direct_Out channel_1.txt channel_2.txt channel_8.txt