Component Score Weighting for GMM based Text-Independent Speaker Verification Liang Lu

Component Score Weighting for GMM based Text-Independent Speaker VerificationLiang Lu luliang07@gmail.com SNLP Unit, France Telecom R&D Beijing 2008-01-21

Outline • Introduction • Conventional LLR and Motivation for detailed score processing • Component Score Weighting • Experimental Results • Conclusion

Introduction State of the art GMM-UBM framework • GMM based model construction • Log-likelihood Ratio (LLR) based decision making • Score Normalisation (Tnorm, Hnorm, etc) for robustesses

Introduction Major challenges • Limited data for speaker model training • Mismatch between training and testing data

Motivation for Component Score Weighting Motivation • The insufficiency of training data and mismatch between training and testing condition make the mixtures in GMM different in discriminative capability • The LLR just sum the score of each mixture without considering its reliability • Does it helpful if LLR considers the discriminative capability of each mixture? Question If it does, how to explore the discriminative capabilities of Gaussian Component Mixtures

Component Score Weighting Our Method First, scatter the LLR to each Gaussian mixture Where, the k-th mixture is dominant for frame , namely, Let we callis the dominant score and is the residual score

Component Score Weighting Extend the original LLR After doing this, the original LLR will be spitted into two score serials, dominant score serial and residual score serial Original: If we consider the discriminative capacity of each Gaussian mixture Extended: in original LLR

Component Score Weighting Now the question is: How can we know the discriminative capability of each Gaussian mixture and what the should be? Our assumption: We believe that the high dominant scores will have better discriminative capability and should be highlighted.

Component Score Weighting Why the high dominant scores? • If the test utterance is from the target speaker, then more components in GMM should get high value compared with UBM. • If the utterance is form imposter, then high-valued components in GMM are hardly more UBM. • If the test utterance is from the target speaker, the low-valued components in GMM is due to the mixtures are not well trained or mismatch exists between training and testing data.

Component Score Weighting We simply used an exponential function as the weighting function The residual scores have little importance and we ignore them finally. The final LLR score is as follows: Restrained Emphasized

Experimental Results Table: Results for GMM baseline and GMM with Component Score Weighting with TNorm Experiments are performed in the 1conv4w-1conv4w task of the 2006 NIST SRE corpora

Conclusion • Split the LLR score and consider the discriminative capacity of Gaussian mixtures is helpful to cope with the insufficiency of training data and mismatch between training and testing condition. • The score weighting function should be coincident with the component score distribution and discriminative capacity. • The exponential weighting function used in this investigation is not universal and also may not optimal. More work is needed to explore an optimal weighting function.

Thanks

Component Score Weighting for GMM based Text-Independent Speaker Verification Liang Lu

Component Score Weighting for GMM based Text-Independent Speaker Verification Liang Lu

Presentation Transcript

Speaker Verification

Speaker Verification

Speaker Verification

GMM-Based Multimodal Biometric Verification

Spectral Features for Automatic Text-Independent Speaker Recognition

A Text-Independent Speaker Recognition System

Text Independent Speaker Identification Using Gaussian Mixture Model

Introduction to Propensity Score Weighting

Component Score Weighting for GMM based Text-Independent Speaker Verification Liang Lu

ALISP based improvement of GMM ’ s for Text-independent Speaker Verification

Support Vector Machines Based Text-Dependent Speaker Verification Using HMM Supervectors

Subband-based Independent Component Analysis

Text Independent Speaker Recognition with Added Noise

Independent Component Analysis

Independent Component Analysis

Speaker Verification

Text independent speaker identification in multilingual environments

Signature with Text-Dependent and Text-Independent Speech for Robust Identity Verification

Segmental Score Fusion for Text-independent Speaker Verification

Speaker : Wei-Lu Lin

Independent Component Analysis-Based Background Subtraction for Indoor Surveillance

Independent Component Clustering