1 / 25

ITU (International Telecommunication Union) ITU-T (Telecommunication standardization sector) Study Group 12 (Perform

ITU (International Telecommunication Union) ITU-T (Telecommunication standardization sector) Study Group 12 (Performance, QoS and QoE) Overview of Speech Activities in ITU-T Study Group 12 QoMEX’10, Trondheim, Norway Sebastian Möller Co-Rapporteur Q.8/12. Overview.

iden
Download Presentation

ITU (International Telecommunication Union) ITU-T (Telecommunication standardization sector) Study Group 12 (Perform

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. ITU (International Telecommunication Union) ITU-T (Telecommunication standardization sector) Study Group 12 (Performance, QoS and QoE) Overview of Speech Activities inITU-T Study Group 12 QoMEX’10, Trondheim, Norway Sebastian MöllerCo-Rapporteur Q.8/12

  2. Overview • Speech-related questions • Subjective quality assessment approaches • Quality prediction approaches • Tasks of Q.9/12: Signal-based models • Tasks of Q.8/12: Parametric models 2

  3. Overview • Speech-related questions • Subjective quality assessment approaches • Quality prediction approaches • Tasks of Q.9/12: Signal-based models • Tasks of Q.8/12: Parametric models 3

  4. Speech-related Questions in SG 12 (1/2)

  5. Speech-related Questions in SG 12 (2/2)

  6. Overview • Speech-related questions • Subjective quality assessment approaches • Quality prediction approaches • Tasks of Q.9/12: Signal-based models • Tasks of Q.8/12: Parametric models 6

  7. Subjective Quality Assessment Approaches Recommendations under Q.7/12. • Rec. P.800: Main Recommendation • Rec. P.805: Conversational Speech Quality • Rec. P.810: Modulated Noise Reference Unit • Rec. P.830: Speech Codec Assessment • Rec. P.835: Speech Quality in Noise • Rec. P.840: Circuit Multiplication Equipment • Rec. P.85: Voice Output Devices • Rec. P.851: Spoken Dialogue Systems • Rec. P.880: Time-varying Quality • Suppl. 24 to P-Series Rec.: Interaction Parameters • [Handbook on Subjective Testing Practical Procedures] 7

  8. Overview • Speech-related questions • Subjective quality assessment approaches • Quality prediction approaches • Tasks of Q.9/12: Signal-based models • Tasks of Q.8/12: Parametric models 8

  9. Motivation, Goals Linguist. Backgr. Experi- ence Attitude Emotions User Factors Subjective Quality Judgment Transmission System Estimated Quality Index Model System Parameters Quality Prediction ApproachesSpeech transmission services. Speech Signals

  10. Quality Prediction ApproachesTaxonomy of prediction models. Input information: • Signals • one or two signals • acoustic or electric • Parameters • Protocol information • Combinations hereof Measurement of input information: • Online • Offline • Estimation Output information: • Listening-only • integral quality • quality features • Conversational • Talking-only Application area: • Planning • Set-up and optimization • Monitoring Network type: • Narrowband • Wideband

  11. Output Information Quality Prediction ApproachesTaxonomy of prediction models: Narrowband case. Input Information

  12. Output Information Quality Prediction Approaches Taxonomy of prediction models: Wideband case. Input Information

  13. Overview • Speech-related questions • Subjective quality assessment approaches • Quality prediction approaches • Tasks of Q.9/12: Signal-based models • Tasks of Q.8/12: Parametric models 13

  14. Tasks of Q.9/12 Overview. • New model for overall speech quality (P.OLQA) • New models for degradation decomposition (P.AMD, P.TCA) • New model for prediction of P.835 scores (P.ONRA) • Methods for talking quality prediction • Models for audio signals (e.g. music) transmitted over telecommunication links like GSM or VoIP • Models for synthesized speech quality • Models for video quality (restriction to low bit-rate coding and limited image sizes) 14

  15. Quality Prediction ModelsSignal-based models. Reference-based approach: x’(k) Clean speech signal Pre- Processing Internal Represent. Distance Average Transform. Transmission System MOS Pre-Processing Internal Represent. y(k) y’(k) (e.g. ITU-T Rec. P.862, 2001; Hauenstein, 1997; Hansen & Kollmeier, 1997)

  16. Quality Prediction ModelsSignal-based models. Internal Representation: Specific Loudness Power Excitation Filter Bank x‘(k) Temporal Masking Compression Spectral Masking TP x2 (Hauenstein, 1997)

  17. Quality Prediction ModelsP.OLQA, P.AMD and P.TCA. Pre- Processing Internal Represent. Transmission System Comparison Integration Transform. Pre- Processing Internal Represent. Discontinuity Indicator Noisiness Indicator Coloration Indicator (Côté 2010; Wältermann et al., 2008) Loudness Indicator

  18. Quality Prediction ModelsMulti-dimensional approaches. F1: Directness/ frequency content F2: Continuity F3: Noisiness noisy F3 not noisy continuous direct, bright F2 F1 (Wältermann et al., 2006) interrupted indirect, dark

  19. Overview • Speech-related questions • Subjective quality assessment approaches • Quality prediction approaches • Tasks of Q.9/12: Signal-based models • Tasks of Q.8/12: Parametric models 19

  20. Tasks of Q.8/12 Overview. • Wideband and mixed-band transmission scenarios • Terminal equipment other than standard handset telephones (e.g. HFTs, headsets) • Degradations introduced by speech-processing devices (e.g. EC, VAD, NR) • Use of the E-model for quality monitoring • Perceptual dimensions other than “impairment”, i.e. “speech sound quality” and conversational quality • Additivity property of the E-model • Coverage of user expectation, development of user expectation over time 20

  21. IP WAN Quality Assessment and Prediction E-model for narrowband networks. Linear distortion, delay Backgr.noise,acoustic coupling Codec Jitterbuffer, VAD Packetloss Talker echo, listener echo Circuit noise Backgr.noise,acoustic coupling

  22. SLR, RLR, Ta IP WAN Ps, Ds, STMR Ie, qdu Bpl Ppl TELR, T, WEPL, Tr Nc, Nfor Pr, Dr, LSTR Quality Assessment and Prediction E-model for narrowband networks. Impairments SNR simultaneous delayed nonlin./timevar. Overall quality R = Ro - Is - Id - Ie,eff Estimated user judgment MOS = f (R )

  23. Quality Assessment and Prediction E-model extension for wideband networks. Ro,max = 129 (Raake, 2006; Appendix II, ITU-T Rec. G.107, 2006)

  24. Quality Assessment and Prediction E-model extension for wideband networks. Ro,max = 129 AMR-WB (23.05) AMR-WB (6.6) (Raake, 2006; Möller et al., 2006)

  25. Thank you for your attention! Further information can be found under www.itu.int/ITU-T/studygroups/com12 25

More Related