7- Speech Quality Assessment

7-Speech Quality Assessment • Quality Levels • Subjective Tests • Objective Tests • Intelligibility • Naturalness

Quality Levels • Synthetic Quality (Under 4.8 kbps) • Communication Quality (4.8 to 13 kbps) • Toll Quality (13 to 64 kbps) • Broadcast Quality (Upper than 64 kbps)

Test Types

First ClassSubjective Intelligibility Tests • Diagnostic Rhyme Test (DRT) • Selecting between two CVC by different first C • First C should have specific properties • Ex. hop - fop And than - dan • Modified Rhyme Test (MRT) • Selecting between CVC’s by different first C • Ex. Cat, bat, rat, mat, fat, sat

First Class (Cont’d)Subjective Intelligibility tests • DRT is very applicable and credible • In this test user can hear the speech only once

Second ClassSubjective Naturalness tests • Mean Opinion Score (MOS) • MOS is very applicable and credible • In this test user can hear the speech a lot • Diagnostic Acceptability Measure (DAM) • This test is very complex

Mean Opinion Score (MOS) • Scores for MOS are like this Score Speech Quality 1 2 3 4 5 Not Acceptable Weak Medium Good Excellent

Diagnostic Acceptability Measure (DAM) • This test is very complex • In this test there is 19 different parameters for score. These parameters divide into 3 main groups: • Signal Quality • Background Quality • Total Quality

Objective Tests • These tests can not be used for intelligibility. Because system couldn’t recognize speech intelligibility • Objective tests can only be used for speech Naturalness

Objective Tests (Cont’d) • Articulation Index (AI) • Signal to Noise Ratio (SNR) • Global (Classic) SNR • Segmental SNR • Frequency Weighted Segmental SNR

Articulation Index (AI) • AI assumes that different frequency bands distortion are independent, and measure signal quality in different bands. • In each band determines percentage of perceptible signal by listener 20 Bands HZ . . . . . . . . . 200 6100

Articulation index (Cont’d) • Perceptible by user signal : • 1- Upper than human hearing threshold • 2- Under than human pain threshold • 3- Upper than Masking Noise level • In each case one of the states 1 or 3 is prevail

Articulation index (Cont’d) • In AI SNR measured isolated in each band

Signal To Noise Ratio(SNR)

Segmental SNR j’th Frame SNR M : Number of frames

Frequency Weighted Segmental SNR K : Number of frequency bands M : Number of frames

Deller Formula

Other Formulas:

Itakura Measure Is the envelope spectrum Use from All-Pole (AR) Model

Itakura Measure (Cont’d) This is based on the spectrum difference between main signal and assessment signal Autoregressive Coefficients Reflection Coefficients Autocorrelation Coefficients

Itakura Measure (Cont’d) m :Index of frame l : Index of coefficients

Itakura Measure (Cont’d) Is the l’th parameter of the frame that conduces m’th sample

Weighted Spectral Slope Measure(WSSM) Is STFT of k’th band of the frame that conduces m’th sample

7- Speech Quality Assessment

7- Speech Quality Assessment

Presentation Transcript

ASSESSMENT OF SPEECH SOUND DISORDERS

6-Speech Quality Assessment

External Quality Assessment

Chapter 7 SPEECH COMMUNICATIONS

Image Quality Assessment

Year 9 Speech Assessment

Quality assessment

Quality Assessment

QUALITY ASSESSMENT PRACTICES

DOIS Quality Assessment

Film Quality Assessment

Quality Assessment

Quality Assessment

7- Speech Recognition (Cont’d)

Speech Quality Measurement Systems

7- Speech Recognition

6- Speech Quality Assessment

WATER QUALITY ASSESSMENT

7- Speech Recognition (Cont’d)

Library Quality Assessment