1 / 13

Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech

Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech. F. Bettens * , F. Grenez * , J. Schoentgen *,** * Université Libre de Bruxelles ** National Fund for Scientific Research Belgium. Existing Cues of Vocal Noise.

nessa
Download Presentation

Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Assessment of Vocal Noise via Bi-directional Long-term Linear Prediction of Running Speech F. Bettens*, F. Grenez*, J. Schoentgen*,** *Université Libre de Bruxelles **National Fund for Scientific Research Belgium

  2. Existing Cues of Vocal Noise • Detection of individual vocal cycles(or harmonics) • Steady vowel fragments • (Pseudo)-Periodicity • Period Perturbation Quotient • Amplitude Perturbation Quotient • Harmonics-to-Noise Ratio

  3. Objectives : Analyses of Dysperiodicities • Give up request that speech fragments are : • (Pseudo)-Periodic • Steady • Any Speech Fragment : • Modal Voices & (Very) Hoarse Voices • Sustained Vowels & Running Speech

  4. Motivation : Analysis of Running Speech • Voicing in running speech • Variable acoustic impedance • Voicing onsets & offsets • Variable pressure drops • Variable laryngeal positions • Voice Loading

  5. Double Linear Predictive Analysis • Conventional short-term linear prediction: • Long-term linear prediction: remove existing correlations  unpredictable noise component (Qi, 1999) forward short-term prediction error forward double prediction error

  6. Double Linear Predictive Analysis  Solutions:  remove short-term linear predictive analysis stage  proceed to bi-directional analysis Drawbacks: • eS[n] is an artificial signal • the dysperiodicities in weighted sum x[n] are omitted • eL[n] is inflated to the right of unvoiced/voiced boundaries

  7. Bi-directional Long-term Prediction • Forward long-term linear prediction: • Backward long-term linear prediction: • Bi-directional long-term linear prediction: keep the “best” (frame by frame) forward long-term prediction error backward long-term prediction error bi-directional long-term prediction error

  8. Long-term Prediction Distance : P Maximum of the auto-correlation function example: steady vowel [a] (dysphonic speaker)   P = 184 (2 cycles)

  9. Vocal Noise Cue Signal-to-Dysperiodicity Ratio: example: steady vowel [a] healthy speaker dysphonic speaker speech signal x[n] bi-directional long-term prediction error eL[n] SDR = 31,2 dB SDR = 10,1 dB

  10. Results1:Sentence(1 female speaker; modal phonation type)(http://www.limsi.fr/VOQUAL/ : “Il est sorti avant le jour”) segments [il] speech signal bi-directional long-term prediction error forward long-term prediction error

  11. Results 2 : Sentence (1 female speaker; 5 phonation types)(http://www.limsi.fr/VOQUAL/ : “Il est sorti avant le jour”)

  12. Conclusion The forward & backward long-term prediction of speech enables the analysis of any speech signal with a view to the assessment of the vocal noise (i.e. vocal dysperiodicities) The analysis is not based on any assumptions regarding the periodicity or stationarity of the speech signals

More Related