1 / 14

Refinement in FTLP-HNM system for Speech Enhancement

Refinement in FTLP-HNM system for Speech Enhancement. Qin Yan Communication & Multimedia Signal Processing Group School of Engineering and Design, Brunel University 23 Nov, 2005. Review of FTLP-HNM system; Parameters estimation of HNM (incl. pitch/harmonic tracking in noise)

patia
Download Presentation

Refinement in FTLP-HNM system for Speech Enhancement

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Refinement in FTLP-HNM system for Speech Enhancement Qin Yan Communication & Multimedia Signal Processing Group School of Engineering and Design, Brunel University 23 Nov, 2005

  2. Review of FTLP-HNM system; Parameters estimation of HNM (incl. pitch/harmonic tracking in noise) Objective results of pitch, harmonic tracking and FTLP-HNM system Demo of enhanced speeches from old archive recordings Outline

  3. Overview of FTLP-HNM Speech Enhancement System

  4. Harmonic plus Noise Model In HNM, speech is decomposed to two parts : Harmonic part and noise part. Harmonic : where L(t) denotes the number of harmonic included in the harmonic part, ω0 denotes the pitch frequency. Noise : where h the a time-varying autoregressive(AR) model and b is white Gaussian noise. Synthesized Speech :

  5. HNM - Pitch Tracking • Error function in frequency domain • In noisy condition the error function is modified to including SNR dependent weights The weighting function W(l) is a SNR-dependent given by • NOTE: • The input speech frame is bandpassed to eliminated the parts which don’t contain explicit harmonics. • For Each speech frame, it outputs several pitch candidates (N=3) and Viterbi algorithm then generates the final pitch tracks. • It might be useful to have candidates from this method and traditional autocorrelation method.

  6. Results of Pitch Tracking Figure - Comparison of the performance of different pitch track methods for speech in (a) train noise (b) car noise from 0dB SNR to clean.

  7. HNM - Harmonic Tracking Noise model VAD Smoothed Harmonic Magnitude by Kalman filter Pitch Tracking Noise Speech Harmonic Track Candidates Harmonic Frequency bin tracks Peak picking Tracking FFT • Data structure of harmonic track candidates are improved and speed up the whole system.

  8. Results of Harmonic Tracking in Clean Speech Figure - An illustration of pitch tracks of a speech segment at sampling frequency of 8kHz.

  9. Harmonic Recovery Pitch recovery Results of Harmonic Tracking in Noisy Speech

  10. Synthesis of Excitation by HNM Voiced Excitation : Unvoiced Excitation : Where b(m) is unit white Gaussian noise , e(m) is original excitation and a is the phases of original excitation.

  11. Results of Speech Enhancement Enhanced speech is synthesized by inverse filtering the HNM residual with cleaned LP shape. Figure - Comparison of the harmonicity of MMSE and FTLP-HNM systems on train noisy speech at different SNRs Figure - Performance of MMSE and FTLP-HNM on train noisy speech at different SNR levels.

  12. Demo (1) Persian speech for Iranian King Mozaffareddin Shah Original speech Enhanced speech

  13. Demo (2) Florence Nightinggale 1890 Original speech Enhanced speech

More Related