590 likes | 878 Views
The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs The Glides The Liquids. Unit 3 outline. Source-Filter Theory: Modeling Vowels. Vowels are Voiced (shapes the glottal source spectrum) oral (velopharyngeal port closed)
E N D
The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs The Glides The Liquids Unit 3 outline SPPA 403 Speech Science
Source-Filter Theory: Modeling Vowels Vowels are • Voiced (shapes the glottal source spectrum) • oral (velopharyngeal port closed) • produced with “open” vocal tract (VT) • Poral ~ Patmos • Classified according to tongue position in VT • front/central/back • high/mid/low SPPA 403 Speech Science
Vowel Quadrilateral SPPA 403 Speech Science
Source-Filter Theory: Modeling Vowels • Mobile articulators serve to change the VT area function so that it is not constant • non-constant area function complexity for determining VT transfer function • However, VT transfer function still based on tube acoustics SPPA 403 Speech Science
Artic. Config.Area FunctionTransfer Function gain glottis lips frequency SPPA 403 Speech Science
Source-Filter Theory: Modeling Vowels • VT has an infinite number of resonances/formants • Identification of vowel quality seems most dependent upon the location of F1, F2 & F3 • These observations are based on • Studies of vowel perception • Modeling efforts which suggest F4-F6 are relatively static • the observation that glottal source spectrum rolls off with increasing frequency SPPA 403 Speech Science
Mid Central vowel F1: 500 Hz F2: 1500 Hz F3: 2500 Hz /i/ /u/ Amplitude/gain // // SPPA 403 Speech Science frequency
/i/ & /u/ have a low F1 // & // have high F1 Tongue height ~ F1 Tongue height F1 Tongue height F1 /u/ & // have low F2 /i/ & // have high F2 Tongue A-P ~ F2 Tongue front F2 Tongue back F2 Tongue “Rules” for vowel formant values SPPA 403 Speech Science
Mid Central vowel F1: 500 Hz F2: 1500 Hz F3: 2500 Hz /i/ /u/ gain // // SPPA 403 Speech Science frequency
Lip rounding (for /u/) ~ F2 Note* lip protrusion will increase the overall length of the vocal tract which will decrease all formant values Lip “Rules” for vowel formant values SPPA 403 Speech Science
Mid Central vowel F1: 500 Hz F2: 1500 Hz /i/ /u/ Amplitude/gain // // SPPA 403 Speech Science frequency
IMPORTANT • Tongue and lip rules are based on how these articulations change the VT area function (shape) • VT area function ultimately determines the VT filter properties SPPA 403 Speech Science
Tf32 example SPPA 403 Speech Science
F1-F2 values for English Vowels SPPA 403 Speech Science
Vowels: Stylized Spectrograms SPPA 403 Speech Science
Vowels: Stylized Spectrograms SPPA 403 Speech Science
/a/ - low back ( F1 F2) SPPA 403 Speech Science
/i/- high front (F1 F2) SPPA 403 Speech Science
/u/ - high back (F1 F2) SPPA 403 Speech Science
/ae/ - low front (F1 F2) SPPA 403 Speech Science
How important are F1-F3 in speech production & perception? SPPA 403 Speech Science
Sinewave Speech Demonstration Sinewave speech examples (from HINT sentence intelligibility test): SPPA 403 Speech Science
Problem • Recall - F1 = c/4l • VT length influences exact frequency location of formants • Speakers vary in their vocal tract length • men > women > children SPPA 403 Speech Science
Problem /i/ /u/ SPPA 403 Speech Science
How do we know that a child, a man and a women all say /i/, when the acoustic values of formants are quite different? SPPA 403 Speech Science
A possible answer?? children women F2 men SPPA 403 Speech Science F1
A possible answer?? • Relative locations of formants is similar across speakers even though absolute values differ • Perhaps we ‘rescale’ our expectations depending upon factors such as gender and age SPPA 403 Speech Science
Vowel articulation and vowel acoustics • Vowel quadrilateral: articulatory plane is similar to • F1-F2 plot: acoustic plane SPPA 403 Speech Science
high low front SPPA 403 Speech Science back
Vowel articulation and vowel acoustics • Vowel quadrilateral: articulatory plane is similar to • F1-F2 plot: acoustic plane SPPA 403 Speech Science
The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs The Glides The Liquids Unit 3 outline SPPA 403 Speech Science
Diphthongs • Slow gliding movement between two vowel qualities • characterized by an articulatory transition • articulatory transition = formant transitions SPPA 403 Speech Science
Diphthongs • /ai/ - “bye” • /au/ - “bough” • /oi/ - “boy” • /ei/ - “bay” SPPA 403 Speech Science
Diphthongs: /ai/ i SPPA 403 Speech Science a
Diphthongs: /au/ SPPA 403 Speech Science a u
The Vocal Tract (VT) Source-Filter Theory of Speech Production Capturing Speech Dynamics The Vowels The Diphthongs The Glides The Liquids Unit 3 outline SPPA 403 Speech Science
Glides (/w/, /j/) & Liquids (/l/, /r/) • often termed sonorants • Associated with • a high degree of vocal tract constriction • articulatory transition = formant transition SPPA 403 Speech Science
Glides (/w/, /j/) & Liquids (/l/, /r/) Degree of Constriction • Greater than vowels • Poral slightly greater than Patmos • Less than fricatives • Poral for glides/liquids < Poral for fricatives • Constriction lasts ~ 100 msec • Constriction results in a loss in energy • weaker formants SPPA 403 Speech Science
Glides (/w/, /j/) & Liquids (/l/, /r/) Transition rate • faster than the diphthongs • slower than the stops • lasts ~ 75 msec SPPA 403 Speech Science
/w/ • Place: labial • Acoustics • /u/-like formant frequencies • Constriction formant values • F1 ~ 330 Hz • F2 ~ 730 Hz • weak F3 (~ 2300 Hz) SPPA 403 Speech Science
/w/ 3000 F3 2000 F2 1000 F1 uh w ae SPPA 403 Speech Science
/j/ • Place: palatal • Acoustics • /i/-like formant frequencies • F1 ~ 300 Hz • F2 ~ 2200 Hz • F3 ~ 3000 Hz SPPA 403 Speech Science
/j/ 3000 F3 2000 F2 1000 F1 uh j ae SPPA 403 Speech Science
/j/ uh j ae SPPA 403 Speech Science
Liquids (/l/, /r/) • lateral /l/ • Retroflex /r/ • Pickett (1999) considers these consonants glides as well SPPA 403 Speech Science
/l/ • Place: alveolar • Articulatory phonetics: • tongue tip contacts alveolar ridge • Constriction is on each side of this obstruction – hence the term lateral • Vocal tract is split – not modeled with a single tube SPPA 403 Speech Science
/l/ • Acoustics • F1 ~ 360 Hz • F2 ~ 1300 Hz • F3 ~ 2700 Hz • F2 is variable and affected by vowel environment • Position in word will affect acoustic features of /l/ • Final /l/ will have a higher F1 & lower F2 SPPA 403 Speech Science