1 / 16

Meeting 7

Meeting 7. Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November, 2005. Kalman Filter: Speech and noise tracking HNM Model: The degree of “ Harmonicity ” Bandwidth extension Future work: noise reduction using HNM. Contents.

Download Presentation

Meeting 7

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November, 2005

  2. Kalman Filter: Speech and noise tracking HNM Model: The degree of “Harmonicity” Bandwidth extension Future work: noise reduction using HNM Contents

  3. Previous method: Modelling speech with an AR model Kalman Filter: Speech and noise tracking X: NoisyS: SpeechD: Noise Noise Variance

  4. New method: Modelling speech AND noise with AR models Kalman Filter: Speech and noise tracking (cont.)

  5. The noise AR models are obtained from noise-only periods. Results from the new model sound more natural Clean Speech Noisy Speech Kalman Old Kalman New Kalman Filter: Speech and noise tracking (cont.)

  6. Harmonic sub-bands are modelled as the sum of a Gaussian and some random noise HNM Model R: Random Noise with Rayleigh Distribution

  7. Sample Reconstructed Frame HNM Model

  8. HNM Model Original Synthesized, PESQ:3.91

  9. HNM Model • Noise severely affects the Ak • Pitch, Harmonicity and Harmonic frequencies are much less distorted by noise • Simple analysis/synthesis of noisy speech improves its quality (SNR<10dB)

  10. Decompose the signal to an LP model (AR or LSF) and an HNM model of the residual (fk,Ak,Vk) Amplitude can be assumed to be equal (whitened by inverse modelling) Frequencies also may be assumed to be multiples of the fundamental frequency (later displaced slightly by LP modelling) LP-HNM LP-HNM synthesizedPESQ: 3.50

  11. One application of the model is Bandwidth Extension for getting 16KHz speech quality from 8KHz Speech Bandwidth Extension 8KHz Speech Signal 16KHz Speech Signal LP-HNM Analysis Trained LP-HNM Model LP-HNM Analysis

  12. Bandwidth Extension • Codebook Mapping is used to obtain higher LPF coefficients from lower LPF coefficients extracted from 8KHz signal • Similar method is used to obtain the harmonicity degree of higher sub-bands

  13. Bandwidth Extension • A shadow codebook for LP gain ratio (G8/G16) is used for gain mapping • Phase is extrapolated assuming a linear phase for the harmonics, some random noise is added to unvoiced sub-bands • The performance of the system deteriorates in noise

  14. Future Work • Tracking the HNM parameters using Kalman filter, in other words, rather than tracking DFT trajectories in one frequency bin, it might be better to track only the harmonic bins (reduced computational complexity) along the harmonic frequencies (intuitively makes more sense!)

  15. Future Work • Some harmonics proved very difficult to recover from noise (e.g. 1-3). Investigate the possibility of a similar model based approach as the BWE method for estimating parameters of those harmonics. The Harmonicity of the sub-bands and the reciprocal noise level at those frequencies may be used as weights in the mapping process. Clean Speech De-noised Speech

  16. Future Work • A is a parameter vector, W is the weighting vector (e.g. reciprocal of normalized noise spectrum). Bj is the jth entry of the codebook. • The result can be used for reconstructing speech

More Related