160 likes | 353 Views
Meeting 7. Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November, 2005. Kalman Filter: Speech and noise tracking HNM Model: The degree of “ Harmonicity ” Bandwidth extension Future work: noise reduction using HNM. Contents.
E N D
Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November, 2005
Kalman Filter: Speech and noise tracking HNM Model: The degree of “Harmonicity” Bandwidth extension Future work: noise reduction using HNM Contents
Previous method: Modelling speech with an AR model Kalman Filter: Speech and noise tracking X: NoisyS: SpeechD: Noise Noise Variance
New method: Modelling speech AND noise with AR models Kalman Filter: Speech and noise tracking (cont.)
The noise AR models are obtained from noise-only periods. Results from the new model sound more natural Clean Speech Noisy Speech Kalman Old Kalman New Kalman Filter: Speech and noise tracking (cont.)
Harmonic sub-bands are modelled as the sum of a Gaussian and some random noise HNM Model R: Random Noise with Rayleigh Distribution
Sample Reconstructed Frame HNM Model
HNM Model Original Synthesized, PESQ:3.91
HNM Model • Noise severely affects the Ak • Pitch, Harmonicity and Harmonic frequencies are much less distorted by noise • Simple analysis/synthesis of noisy speech improves its quality (SNR<10dB)
Decompose the signal to an LP model (AR or LSF) and an HNM model of the residual (fk,Ak,Vk) Amplitude can be assumed to be equal (whitened by inverse modelling) Frequencies also may be assumed to be multiples of the fundamental frequency (later displaced slightly by LP modelling) LP-HNM LP-HNM synthesizedPESQ: 3.50
One application of the model is Bandwidth Extension for getting 16KHz speech quality from 8KHz Speech Bandwidth Extension 8KHz Speech Signal 16KHz Speech Signal LP-HNM Analysis Trained LP-HNM Model LP-HNM Analysis
Bandwidth Extension • Codebook Mapping is used to obtain higher LPF coefficients from lower LPF coefficients extracted from 8KHz signal • Similar method is used to obtain the harmonicity degree of higher sub-bands
Bandwidth Extension • A shadow codebook for LP gain ratio (G8/G16) is used for gain mapping • Phase is extrapolated assuming a linear phase for the harmonics, some random noise is added to unvoiced sub-bands • The performance of the system deteriorates in noise
Future Work • Tracking the HNM parameters using Kalman filter, in other words, rather than tracking DFT trajectories in one frequency bin, it might be better to track only the harmonic bins (reduced computational complexity) along the harmonic frequencies (intuitively makes more sense!)
Future Work • Some harmonics proved very difficult to recover from noise (e.g. 1-3). Investigate the possibility of a similar model based approach as the BWE method for estimating parameters of those harmonics. The Harmonicity of the sub-bands and the reciprocal noise level at those frequencies may be used as weights in the mapping process. Clean Speech De-noised Speech
Future Work • A is a parameter vector, W is the weighting vector (e.g. reciprocal of normalized noise spectrum). Bj is the jth entry of the codebook. • The result can be used for reconstructing speech