200 likes | 436 Views
A Voicing-Driven Packet Loss Recovery Algorithm for Analysis-by-Synthesis Predictive Speech Coders over Internet. Jhing-Fa Wang, Fellow, IEEE, Jia-Ching Wang, Jar-Ferr Wang, and Jian-Jia Wang IEEE Transaction on Multimedia, VOL. 3, NO. 1, March 2001. Outline. Introduction
E N D
A Voicing-Driven Packet Loss Recovery Algorithm for Analysis-by-Synthesis Predictive Speech Coders over Internet Jhing-Fa Wang, Fellow, IEEE, Jia-Ching Wang, Jar-Ferr Wang, and Jian-Jia Wang IEEE Transaction on Multimedia, VOL. 3, NO. 1, March 2001 Chin-Kai Wu, CS, NTHU
Outline • Introduction • Analysis-by-Synthesis Predictive Coders • Voicing-Driven Packet Loss Recovery Algorithm • Multiresolution Excitation Generation • Pulse Tracking Procedure • Experimental Results • Conclusion Chin-Kai Wu, CS, NTHU
Analysis-by-Synthesis Predictive Coders Codebook Index LPC parameter Gain aperiodic Total Excitation periodic Chin-Kai Wu, CS, NTHU
Voicing-Driven Packet Loss Recovery Algorithm nearest half previous frame Chin-Kai Wu, CS, NTHU
Multiresolution Excitation Generation • Reason to use wavelet transform • Good time-frequency localization • Multiresolution characteristics • Revealing some characteristics that other signal analysis techniques miss Two-stage two-band Wavelet analysis filter banks cj: scaling coefficient dj: wavelet coefficient h(n): Lowpass filter coefficient g(n): Highpass filter coefficient Chin-Kai Wu, CS, NTHU
voiced unvoiced Multiresolution Excitation Generation (Cont’d) Chin-Kai Wu, CS, NTHU
Multiresolution Excitation Generation (Cont’d) Chin-Kai Wu, CS, NTHU
Pulse Tracking Procedure • Pulse Position Tracking Algorithm • Pulse Amplitude Estimation Algorithm Chin-Kai Wu, CS, NTHU
Pulse Position Tracking Algorithm (Step 1) • Find the maximum absolute value within the frame and denote its position as P (peak position) tracking • Set all sample with opposite sign to zero Chin-Kai Wu, CS, NTHU
Pulse Position Tracking Algorithm (Step 2) • Construct a clipper by defining the clipping level from the maximum absolute value • Set samples below the clipping level to zero Chin-Kai Wu, CS, NTHU
Pulse Position Tracking Algorithm (Step 3) • Record peak position P and set a clearing region to reset all the samples within the region to zero Chin-Kai Wu, CS, NTHU
Pulse Position Tracking Algorithm (Step 4) • Repeat Step 1 to 3 to recorder all the recorded peaks according to the positions and denote them as {p1, p2, …, pNp} Chin-Kai Wu, CS, NTHU
Pulse Position Tracking Algorithm (Result) Chin-Kai Wu, CS, NTHU
Slope of where Pulse Amplitude Estimation Algorithm (Step 1) • Compute the trend slope associated with the envelope of the speech peaks Np: Number of peaks Chin-Kai Wu, CS, NTHU
Pulse Amplitude Estimation Algorithm (Step 2) • Estimate the pulse position of the missing frame. Let be the set of estimated pulses, where Chin-Kai Wu, CS, NTHU
Pulse Amplitude Estimation Algorithm (Step 3) • Estimate the envelope of the speech peaks in the missing frame • Track the amplitude of each excitation pulse Chin-Kai Wu, CS, NTHU
Pulse Amplitude Estimation Algorithm (Result) Chin-Kai Wu, CS, NTHU
Experimental Results Chin-Kai Wu, CS, NTHU
Experimental Results (Cont’d) Chin-Kai Wu, CS, NTHU
Conclusion • Packet loss degrades the speech quality of the analysis-by-synthesis coders seriously since the loss parameters not only affect the current speech frame but also produce error propagation problem • The proposed recovery algorithm estimates the excitation information in the missing frame more accurately due to the selection of different excitation generation models Chin-Kai Wu, CS, NTHU