190 likes | 323 Views
RESYNCHRONIZATION OF THE ADAPTIVE CODEBOOK IN A CONSTRAINED CELP CODEC AFTER A FRAME ERASURE. Mohamed Chibani, Roch Lefebvre and Philippe Gournay Université de Sherbrooke, Sherbrooke, Québec, Canada. Outline. Basic CELP model Constrained optimization Resynchronization at the decoder
E N D
RESYNCHRONIZATION OF THE ADAPTIVE CODEBOOK IN A CONSTRAINED CELP CODECAFTER A FRAME ERASURE Mohamed Chibani, Roch Lefebvre and Philippe Gournay Université de Sherbrooke, Sherbrooke, Québec, Canada
Outline • Basic CELP model • Constrained optimization • Resynchronization at the decoder • Open-loop search of the shift (drift) of the ACB • Closed-loop search of the shift • Pitch contour modification • Experimental results • Conclusions
Prelude to the Resynchronization Algorithm • After a frame erasure, both the waveform and the position of the pitch pulses in the ACB memory are erroneous. • For voiced speech, the pitch pulse waveform evolves slowly. • If the expected position of the last pitch pulse in the ACB memory can be determined, the ACB memory can be corrected. • Due to the constraint, a good approximation of the pitch pulse can be obtained using only the parameters of the current frame.
The Excitation Signal Obtained After Setting to Zero the ACB Memory
ACB delays P ( 0 ) P ( 1 ) P(-1) The last pulse in the ACB memory Determination of the Expected Pitch Pulse Position in the Erroneous ACB Memory The excitation e0(n) The correct excitation
Estimation of the Shift 0 P(0) : The expected position of the last pitch pulse in the ACB memory P(-1) : The actual position of the last pitch pulse in the ACB memory
Closed-loop Search for the Optimal Shift e is the excitation signal built after correcting the ACB for every shift candidate L_FRM=256 L=max(2*L_SBFR,T(3)) T(3) is the ACB delay of the 4th subframe
Example of a Resynchronized Excitation The correct excitation The excitation e0(n) The excitation signal built using the erroneous ACB memory The excitation signal built after correcting the ACB memory
Modification of the Pitch Contour After the Resynchronization The correct excitation The excitation after the resynchronization The excitation after the modification of the pitch contour iis the shift of each interval Np is the number of pitch periods
The Effect of the Resynchronization Algorithm when Applied on Voiced Speech Segment Error-free signal Standard codec Constrained codec Constr. + resynchro.
Standard codec 90 73.67 73.67 74.88 Constrained codec 80 Constr. + Resynchro. 53.90 70 49.18 60 42.36 40.40 36.70 50 29.99 40 30 20 10 0 Experimental Results Test features: • AMR-WB at mode 2 (12.65 kb/s) • 10 listeners • 14 pairs of sentences for each condition • Listening using binaural headphones MUSHRA Score 0% 5% 10% Frame erasure rate
Conclusions • The resynchronization allows to speed up the recovery of the decoder after a frame erasure. • The method (constraint + resynchronization) needs neither extra bits nor extra delay. • The modified codec is completely interoperable with the standard (the bitstream is not modified). • Only 10 to 15% of the frames following an erased frame are resynchronized. • The only drawback is a minor loss of quality in error-free channels.