Predicting Speech Intelligibility

Predicting Speech Intelligibility

Where we were… • Model ofspeechintelligibility • Goodpredictionof Greenberg’sbands Data

Greenberg Bands • Timit Sentences filtered into four narrow spectral channels • Task identifyspeech codedin various channel combinations

Greenberg’s data • High levels ofintelligibilityfor reducedrepresentation • Intelligibility isnot sum of partsch 2 : 9ch 3 : +9ch23: =60

Hearing threshold

Matching Perceptual Data • R2=0.987 for bands data • Model • Filterbank (32 channels ERB) • Modulation filter (Fc=1kHz) • Mutual information in modulation map/spectrum space • But how well does it compare

SII • ASA Working Group S3-79,in charge of reviewing ANSIS3.5-1997 (“Methods for Calculation of the Speech Intelligibility Index”).http://www.sii.to/ • SII computes intelligibility • Speech Spectrum Level • Equivalent Noise Spectrum Level • Equivalent Hearing Threshold Level [dBHL] • Band Importance function

Predictions for: • Average speech • various nonsense syllable tests where most English phonemes occur equally often • CID-22 • NU6 • Diagnostic Rhyme test • short passages of easy reading material • SPIN

SII predictions for Bands • Hannes Muesch: SII is not designed to work for narrow spectral bands – and it doesn’t… • Bands 1234: prediction 25% ; reality 88%

Reasons for SII failure • SII is a glorified lookup table • computes weighted contribution of individual channels, assumption ‘broad bands’ • Adjacent auditory channels are highly correlated • A contiguous band of 4 channels is • “one information bearing channel”, plus • “three channels with little extra information

How does SII compare • Our algorithm computes ‘information’ not intelligibility • Expect vocabulary size, word type … to make a difference

SII fit

Conclusion • MI based measure marginally better than SII if treated equally,BUT • SII is based on lookup tables, with only small model components (masking, thold) • MI measure is an algorithmic solution • Key Question • Does MI solution generalise? • How to deal with wide bands?

Generalisation • Currently running series of experiments using BKB data • white noise • Greenberg slits

Dealing with correlation • Need to compute the ‘added information’ that extra channels contribute to existing channels – could do with principled solution here

Predicting Speech Intelligibility

Predicting Speech Intelligibility

Presentation Transcript

Enhanced energy redistribution speech intelligibility algorithm with real-time implementation

Room Acoustics Speech Intelligibility STI

The Relation Between Speech Intelligibility and The Complex Modulation Spectrum Steven Greenberg International Computer

Assessing Speech Intelligibility and Severity

Predicting Homicides

New Technique for Improving Speech Intelligibility for the Hearing Impaired

Predicting Weather

Predicting

Predicting Heights

PREDICTING

Predicting Products

Intelligibility of Speech Produced by Young Children with Cochlear Implants

Predicting

Prosody and Intelligibility of Speech in Children with Cochlear Implants

Predicting eclipses

Predicting Products

Predicting Performance

Predicting

Why an objective intelligibility assessment ?

PREDICTING PREDICTABILITY

Lecture 1.2. THEORY OF SPEECH INTELLIGIBILITY