1 / 41

Voice source characterisation

Research focusing on physiology, acoustics, and perception of voice source for speech synthesis, speech recognition, and diagnostics of voice pathologies. Methods include acoustic measurements, vocal intensity range, and vocal registers analysis.

winnieh
Download Presentation

Voice source characterisation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Voice source characterisation Gerrit Bloothooft UiL-OTS Utrecht University

  2. Voice research To describe and model the properties of the vocal sound source from view points of: • Physiology • Acoustics • Perception Voice Source Characterization

  3. Importance of the voice • Speech synthesis • Towards natural sounding synthesis • Speech recognition • Using source properties in recognition • Speaker recognition/identification • Voice source characteristics are essential • Diagnosis • Pathologies, voice classifications Voice Source Characterization

  4. Voice possibilities Limited use of voice in speech • Range of the fundamental frequency • Vocal intensity range • Spectral variation Voice Source Characterization

  5. Focus in this presentation How do acoustic voice source characteristics vary as a functionof F0 and vocal intensity Voice Source Characterization

  6. Voice profile measurement Thirties: Intensity range as function of various pitches • manual measurement Eighties: Automatic computation ofF0 and Intensity • computer measurement • visual feedback • additional parameters Voice Source Characterization

  7. Measurement unit • One decibel • One semi-tone Voice Source Characterization

  8. Measurement procedure • Subject in front of computer screen • Microphone on head set (30 cm) • Just phonate, sing, and see the result immediately • Best results with recording protocol • Feed back stimulates extreme phonations Voice Source Characterization

  9. Voice profile / density Vocal Intensity (dB SPL) Sample density Fundamental frequency (Hz) Voice Source Characterization

  10. Voice profile / speech area Vocal Intensity (dB SPL) Sample density Fundamental frequency (Hz) Voice Source Characterization

  11. Acoustic voice quality parameters • Jitter • Stability of periodicity • Asymmetry in vocal folds • Crest factor • Max amplitude divided by average energy • Relates to spectral slope • Many more … Voice Source Characterization

  12. Crest factor Vocal Intensity (dB SPL) Crest factor Fundamental frequency (Hz) Voice Source Characterization

  13. Voice Source Characterization

  14. Real time presentation Screen presentation • One data point per F0-I cell Advanced data storage [new] • Full audio signal • Full distribution of data per F0-I cell • Data for screen presentation Voice Source Characterization

  15. Advantages • Reusability of recordings • Statistical analysis per F0-I cell • Study of time-varying behavior Voice Source Characterization

  16. Crest factor Vocal Intensity (dB SPL) Crest factor Fundamental frequency (Hz) Voice Source Characterization

  17. Median smoothing of crest factor Crest factor median smoothed Vocal Intensity (dB SPL) Crest factor Fundamental frequency (Hz) Voice Source Characterization

  18. Vocal Registers Different movement patterns of the vocal folds • Pulse register (creaky voice) • Modal register • Falsetto register Voice Source Characterization

  19. Pulse register • Less than 50 Hz • Irregular • Long closed period Voice Source Characterization

  20. Pulse register Vocal Intensity (dB SPL) Fundamental Frequency (Hz) Voice Source Characterization

  21. Modal register • “Normal” use of voice • Active role of M. Vocalis • Vocal folds thick and completely vibrating • Wide range in F0 and intensity • Flat spectrum Voice Source Characterization

  22. Modal register Vocal Intensity (dB SPL) Fundamental frequency (Hz) Voice Source Characterization

  23. Falsetto register • Higher pitches • M. Vocalis passive, tense vocal ligaments through M.Cricothyroidus • Edge vibration of vocal volds • Sound poor in higher harmonics (in untrained subjects) Voice Source Characterization

  24. Falsetto register Vocal Intensity (dB SPL) Fundamental frequency (Hz) Voice Source Characterization

  25. Register overlap Vocal Inensity (dB SPL) Fundamental frequency (Hz) Voice Source Characterization

  26. Chest- en head voice Refer to secundary vibratory sensations in the body • Chest voice: loud modal register • Head voice: • males: higher, softer modal register in overlap area with falsetto register • women: falsetto register Voice Source Characterization

  27. Chest voice and Head voice chest Vocal Intensity (dB SPL) head Fundamental frequency (Hz) Voice Source Characterization

  28. Registers and voice profiles With a description using • Iso-crest factor lines • Iso-jitter lines Voice Source Characterization

  29. Iso-crest factor lines 6 dB Vocal Intensity (dB SPL) Crest factor 4 dB Fundamental frequency (Hz) Voice Source Characterization

  30. Iso-jitter lines Vocal Intensity (dB SPL) Jitter (%) 3 % Fundamental frequency (Hz) Voice Source Characterization

  31. New representation • Areas defined by iso-parameter lines • crest factor < 4 dB • crest factor > 4 dB, < 6 dB • crest factor > 6 dB • jitter < 3 % • [relative rise time < 6 %] Voice Source Characterization

  32. Areas in the phonetogram RRT < 6 % pressed-like Crest factor < 4 dB sine-like Vocal Intensity (dB SPL) Jitter > 3%, unstable Fundamental frequency (Hz) Voice Source Characterization

  33. Vocal registers in the phonetogram Chest voice boundary Vocal Intensity (dB SPL) Falsetto upper boundary Modal lower boundary Fundamental frequency (Hz) Voice Source Characterization

  34. Comparison of voice profiles Characterisation of • Voice pathologies • Voice classifications Reuse stored voice profiles of subjects with known voice history Voice Source Characterization

  35. Important features • Contour has limited value • but most research goes into that direction (norm profiles) • Distribution of acoustical parameters across the voice profile tells much more Voice Source Characterization

  36. We need • Unit for comparison • Voice profile unit defined by small range of F0 and Vocal Intensity • Distributions of acoustic voice parameters per unit • Probability density function per parameter • Model • Hidden Markov Model Voice Source Characterization

  37. Unit model • two unconnected states per phonetogram unit • vocal registers • start and end of phonetion Voice Source Characterization

  38. Correspondences • Speech Voice Profile • phoneme model F0/I unit model • not labeled labeled by F0 and I • spectral envelope acoustic voice parameters • language model unrestricted transitions • “forced alignment • recognition” Voice Source Characterization

  39. Crest factor distributions Voice Source Characterization

  40. Most distinctive states Vocal Intensity (dB SPL) Distinctiveness Fundamental frequency (Hz) Voice Source Characterization

  41. Conclusions • Voice profiles can enhance our understanding of vocal behaviour in a visually attractive way • Current data storage opens a series of important research topics • Market opportunities for “light” versions Voice Source Characterization

More Related