Audio and Speech Processing Topic-3 Noise Reduction

1. Audio and Speech ProcessingTopic-3 Noise Reduction Marc Moonen/Ann Spriet Dept. E.E./ESAT, K.U.Leuven marc.moonen@esat.kuleuven.be homes.esat.kuleuven.be/~moonen/

2. Version 2010-2011 Topic 3: Noise Reduction Overview Spectral subtraction for single-micr. noise reduction Single-microphone noise reduction problem Spectral subtraction basics (=spectral filtering) Features: gain functions, implementation, musical noise,� Iterative Wiener filter based on speech modeling Multi-channel Wiener filter for multi-micr. noise red. Multi-microphone noise reduction problem Multi-channel Wiener filter (=spectral+spatial filtering) Kalman filter based on speech & noise modeling Kalman filters Kalman filters for noise reduction

3. Version 2010-2011 Topic 3: Noise Reduction Single-Microphone Noise Reduction Problem Microphone signal is Goal: Estimate s[k] based on y[k] Applications: Speech enhancement in conferencing, handsfree telephony, hearing aids, � Digital audio restoration Will consider speech applications: s[k] = speech signal

4. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction Methods: Basics Signal chopped into `frames� (e.g. 10..20msec), for each frame a frequency domain representation is (i-th frame) However, speech signal is an on/off signal, hence some frames have speech +noise, i.e. some frames have noise only, i.e. A speech detection algorithm is needed to distinguish between these 2 types of frames (based on energy/dynamic range/statistical properties,�)

5. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction Methods: Basics Definition: ?(?) = average amplitude of noise spectrum Assumption: noise characteristics change slowly, hence estimate ?(?) by (long-time) averaging over (M) noise-only frames Estimate clean speech spectrum Si(?) (for each frame), using corrupted speech spectrum Yi(?) (for each frame, i.e. short-time estimate) + estimated ?(?): based on `gain function�

6. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Gain Functions

7. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Gain Functions Example 1: Ephraim-Malah Suppression Rule (EMSR) with: This corresponds to a MMSE (*) estimation of the speech spectral amplitude |Si(?)| based on observation Yi(?) ( estimate equal to E{ |Si(?)| | Yi(?) } ) assuming Gaussian a priori distributions for Si(?) and Ni(?) [Ephraim & Malah 1984]. Similar formula for MMSE log-spectral amplitude estimation [Ephraim & Malah 1985]. (*) minimum mean squared error

8. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Gain Functions Example 2: Magnitude Subtraction Signal model: Estimation of clean speech spectrum: PS: half-wave rectification

9. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Gain Functions Example 3: Wiener Estimation Linear MMSE estimation: find linear filter Gi(?) to minimize MSE Solution: Assume speech s[k] and noise n[k] are uncorrelated, then... PS: half-wave rectification

10. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Implementation Short-time Fourier Transform (=uniform DFT-modulated analysis filter bank) = estimate for Y(?n ) at time i (i-th frame) N=number of frequency bins (channels) n=0..N-1 M=downsampling factor K=frame length h[k] = length-K analysis window (=prototype filter) frames with 50%...66% overlap (i.e. 2-, 3-fold oversampling, N=2M..3M) subband processing: synthesis bank: matched to analysis bank (see DSP-II)

11. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Musical Noise Audio demo: car noise Artifact: musical noise What? Short-time estimates of |Yi(?)| fluctuate randomly in noise-only frames, resulting in random gains Gi(?) statistical analysis shows that broadband noise is transformed into signal composed of short-lived tones with randomly distributed frequencies (=musical noise)

12. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Musical Noise Solutions? Magnitude averaging: replace Yi(?) in calculation of Gi(?) by a local average over frames EMSR (p7) augment Gi(?) with soft-decision VAD: Gi(?) ? P(H1 | Yi(?)). Gi(?) �

13. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Iterative Wiener Filter Basic: Wiener filtering based spectral subtraction (p.9), with (improved) spectra estimation based on parametric models Procedure: Estimate parameters of a speech model from noisy signal y[k] Using estimated speech parameters, perform noise reduction (e.g. Wiener estimation, p. 9) Re-estimate parameters of speech model from the speech signal estimate Iterate 2 & 3

14. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Iterative Wiener Filter

15. Version 2010-2011 Topic 3: Noise Reduction Spectral Subtraction: Iterative Wiener Filter For each frame (vector) y[m] (i=iteration nr.) Estimate and Construct Wiener Filter (p.9) with: estimated during noise-only periods 3. Filter speech frame y[m]

16. Version 2010-2011 Topic 3: Noise Reduction Overview Spectral subtraction for single-micr. noise reduction Single-microphone noise reduction problem Spectral subtraction basics (=spectral filtering) Features: gain functions, implementation, musical noise,� Iterative Wiener filter based on speech modeling Multi-channel Wiener filter for multi-micr. noise red. Multi-microphone noise reduction problem Multi-channel Wiener filter (=spectral+spatial filtering) Kalman filter based on speech & noise modeling Kalman filters Kalman filters for noise reduction

17. Version 2010-2011 Topic 3: Noise Reduction Multi-Microphone Noise Reduction Problem

18. Version 2010-2011 Topic 3: Noise Reduction Multi-Microphone Noise Reduction Problem

19. Version 2010-2011 Topic 3: Noise Reduction Multi-Microphone Noise Reduction Problem Data model: See Topic-2 on multi-path propagation, with q left out for conciseness. Hm(?) is complete transfer function from speech source position to m-the microphone

20. Version 2010-2011 Topic 3: Noise Reduction Multi-Channel Wiener Filter (MWF) Data model: Will use linear filters to obtain speech estimate (as in Topic-2) Wiener filter (=linear MMSE approach) Note that (unlike in DSP-II) `desired response� signal S1(w) is unknown here (!), hence solution will be `unusual��

21. Version 2010-2011 Topic 3: Noise Reduction Multi-Channel Wiener Filter (MWF) Wiener filter solution is (see DSP-II) All quantities can be computed ! Special case of this is single-channel Wiener filter formula on p.9

22. Version 2010-2011 Topic 3: Noise Reduction MWF combines spatial filtering (as in Topic-2) with single-channel spectral filtering (as in Topic-3/single-channel noise reduction) : if then� Multi-Channel Wiener Filter (MWF)

23. Version 2010-2011 Topic 3: Noise Reduction �then it can be shown that represents a spatial filtering (*) Compare to formulae for superdirective & delay-and-sum beamf. (Topic-2) Delay-and-sum beamf. maximizes array gain in white noise field Superdirective beamf. maximizes array gain in diffuse noise field MWF maximizes array gain in unknown (!) noise field. MWF is operated without invoking any prior knowledge (steering vector/noise field) ! (the secret is in the voice activity detection� (explain)) (*) Note that spatial filtering can improve SNR, spectral filtering never improves SNR (at one frequency) Multi-Channel Wiener Filter (MWF)

24. Version 2010-2011 Topic 3: Noise Reduction �then it can be shown that (continued) represents an additional `spectral post-filter� i.e. single-channel Wiener estimate (p.9), applied to output signal of spatial filter (prove it!) Multi-Channel Wiener Filter (MWF)

25. Version 2010-2011 Topic 3: Noise Reduction Multi-Channel Wiener Filter: Implementation

26. Version 2010-2011 Topic 3: Noise Reduction Solution is� Multi-Channel Wiener Filter: Implementation

27. Version 2010-2011 Topic 3: Noise Reduction Multi-Channel Wiener Filter: Implementation

28. Version 2010-2011 Topic 3: Noise Reduction SDW-MWF is MWF with additional tuning parameter: Design criterion for can be re-written as i.e. speech distortion+residual noise is minimized Speech Distortion Weighted MWF

29. Version 2010-2011 Topic 3: Noise Reduction Design criterion may now be modified to trade-off noise reduction against speech distortion: Then optimal solution is� i.e. (rather) straightforward modification By increasing `mu�, more noise is reduced, at the expense of more speech distortion (which is acceptable to a certain level) . means all emphasis is on noise reduction, speech distortion is ignored ( and then ! ) Speech Distortion Weighted MWF

30. Version 2010-2011 Topic 3: Noise Reduction Overview Spectral subtraction for single-micr. noise reduction Single-microphone noise reduction problem Spectral subtraction basics (=spectral filtering) Features: gain functions, implementation, musical noise,� Iterative Wiener filter based on speech modeling Multi-channel Wiener filter for multi-micr. noise red. Multi-microphone noise reduction problem Multi-channel Wiener filter (=spectral+spatial filtering) Kalman filter based on speech & noise modeling Kalman filters Kalman filters for noise reduction

31. Version 2010-2011 Topic 3: Noise Reduction Kalman Filter Given: state space model of a discrete-time MIMO system with v[k] and w[k]: mutually uncorrelated, zero mean, white noises Then: given A, B, C, D and input/output-observations u[k],y[k], k=1,2,... Kalman filter produces MMSE estimates of internal states x[k], k=1,2,... (=`Wiener filter for dynamic systems�)

32. Version 2010-2011 Topic 3: Noise Reduction Kalman Filter Definition: = MMSE-estimate of x[k] using all available data up until time l `FILTERING� = estimate `PREDICTION� = estimate `SMOOTHING� = estimate

33. Version 2010-2011 Topic 3: Noise Reduction Kalman Filter: filtering and 1-step prediction Given together with error covariance matrix P[k|k-1] Then obtain and using u[k], y[k] : Step 1: Measurement Update Step 2: Time Update

34. Version 2010-2011 Topic 3: Noise Reduction Kalman Filter: smoothing Estimate states x[1], x[2],�, x[N] based on data u[k], y[k], k = 1, 2, �N How? 1. forward run: apply previous equations for k = 1, 2, � N Result: estimates 2. backward run: apply following equations for k = N, N -1, �1 Result: (better) estimates

35. Version 2010-2011 Topic 3: Noise Reduction Kalman filter for Speech Enhancement Assume AR model of speech and noise Equivalent state-space model is� y=microphone signal

36. Version 2010-2011 Topic 3: Noise Reduction Kalman filter for Speech Enhancement with:

37. Version 2010-2011 Topic 3: Noise Reduction Kalman filter for Speech Enhancement PS: This was single-microphone case. How can this be extended to multi-microphone case ? Same A, x, v C=?

38. Version 2010-2011 Topic 3: Noise Reduction Kalman filter for Speech Enhancement

39. Version 2010-2011 Topic 3: Noise Reduction Kalman filter for Speech Enhancement iteration index time index (no iterations)

40. Version 2010-2011 Topic 3: Noise Reduction CONCLUSIONS Single-channel noise reduction Basic system is spectral subtraction Only spectral filtering, not easily extended to multi-channel case for additional spatial filtering Hence can only exploit differences in spectra between noise and speech signal: noise reduction at expense of speech distortion achievable noise reduction may be limited Multi-channel noise reduction Basic system is MWF, possibly extended with speech distortion regularization Provides spectral + spatial filtering (links with beamforming!) Kalman filtering alternative approach (though not easily applied in practice)

Audio and Speech Processing Topic-3 Noise Reduction

Audio and Speech Processing Topic-3 Noise Reduction

Presentation Transcript

Digital Audio Signal Processing Lecture-3: Noise Reduction

Speech and Audio Processing and Coding (cont.)

Audio and Speech Processing Topic 5: Acoustic Feedback Control

Speech and Audio Processing and Coding (cont.)

Speech and Audio Processing and Coding

Noise Processing

[Advanced] Speech & Audio Signal Processing

Digital Audio Signal Processing Topic-7: Active Noise Control & 3D Audio

Reduction of Additive Noise in the Digital Processing of Speech

Multi-microphone noise reduction and dereverberation techniques for speech applications

Speech Enhancement through Noise Reduction

Speech and Audio Coding

Speech and Audio Processing and Coding (cont.)

Reduction of Additive Noise in the Digital Processing of Speech

Noise Reduction

Noise Reduction in Speech Recognition

Speech & Audio Processing

Digital Audio Signal Processing Lecture-4: Noise Reduction

Noise Reduction

Noise reduction and addition in sensory-motor processing

Speech/Audio Signal Processing in MATLAB/Simulink

Speech & Audio Processing

Audio and Speech Processing Topic-3 Noise Reduction