Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC

Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 28 January, 2004

Contents The effect of noise on LP-Model Poles Formant Extraction using LP-Model of speech Recognition: Formants Vs. MFCC Features The effect of Maximum-Normalization and Mean Subtraction Static Vs. Dynamic Features

Histogram of Pole Frequencies for Different phonemesMale Speaker – Train Noise SNR = 0

Signal Pre-Processing Save the Formants, move to the next segment and repeat the procedure until the end of signal is reached. Windowing Yes Do poles meet Conditions? LP-Modelling and LP-Pole Extraction no Increase LP Order Formant Extraction Using LP-Model Poles • Maximum BW of formants • Limited frequency range • Fixed number of formants • Candidate Sets • Distant measure • Procedure in consonants

Using LP Formants as features for recognition • In addition to the Frequency of poles their Band widths and Magnitudes are used as well • The HMM models are trained on mono-phones.

Recognition ResultsFormants Vs. MFCC • MFCC Features contain C0 , Delta and Delta-Delta Features • Appended Features are vectors of MFCC appended to formants (length=75)

In Maximum Normalizing each row is divided by the maximum absolute value of that particular row. In Mean Subtraction the mean of each row is subtracted so that the mean of each row will be set to zero. Combining these two, first the features are mean subtracted, then maximum normalized. Maximum Normalizing and Mean Subtracting the features

Recognition ResultsMFCC Vs. Mean Subtracted Max Normalized MFCC With C0 • C0 is badly affected by noise.

Recognition ResultsMFCC Vs. Mean Subtracted Max Normalized MFCCWithout C0 • The effect of noise on C0 can be compensated to some extents by Normalizing the features

Recognition ResultsFormants Vs. Mean Subtracted Max Normalized Formants • Normalization increases the Recognition rate 10% in noisy conditions

MFCC - Dynamic Vs. ‘Static’ Features Dynamic Values are Delta and Acceleration Values ‘Static’ Values are the Actual Values

Formants - Dynamic Vs. ‘Static’ Features Dynamic Values are Delta and Acceleration Values ‘Static’ Values are the Actual Values

Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC

Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC

Presentation Transcript

Synthesis of Noise Effects on Wildlife Populations

The Effects of Alcohol and Tobacco on the Heart Rate of Daphnia magna

D2a Analysis of the Effects of Exercise on the Musculoskeletal System

Recognition and Treatment of HCT Late Effects

Experiments on Noise Analysis

Effects of Temperature on Rate of Cellular Respiration

The effects of microgravity on the fermentation of honey using yeast

The effects of Alcohol on the pulse rate of the Lumbriculus Variegatus

Effects of Metrical Subdivision on Perceived Beat Rate

Angular Spectrum Analysis of Ocean Noise Using Discrete Noise Source suppression

The Effects of Caffeine on the Heart Rate of Daphnia magna

The Effects of Different Drinks on Heart Rate During Exercise

The Effects of Restriction of Recognition on False Memory

Analysis of noise runs

The Weird Noise On The Train

The nature of X(3900) and recognition of open charm effects

Effects on Outcomes of Heart Rate Reduction

Analysis of Effects of Train/Car noise in Formant Track Estimation

Image recognition using analysis of the frequency domain features

The Effects of Caffeine on the Heart Rate of Daphnia magna

Analysis of quantum noise

Protect Your Ears From The Effects Of Noise