550 likes | 857 Views
Corpus annotation and analysis using Praat. Marc Brunelle University of Ottawa marc.brunelle@uottawa.ca Tr ường Đại Học Khoa Học Xã Hội và Nhân Văn, TP. Hồ Chí Minh, June 22 th , 2013. Outline. Summary of spectrogram reading, with an emphasis on consonants Summary of basic Praat functions
E N D
Corpus annotation and analysis using Praat Marc Brunelle University of Ottawa marc.brunelle@uottawa.ca Trường Đại Học Khoa Học Xã Hội và Nhân Văn, TP. Hồ Chí Minh, June 22th, 2013
Outline • Summary of spectrogram reading, with an emphasis on consonants • Summary of basic Praat functions • Corpus annotation in Praat • Automatic analysis of prosody in Praat
Vowels • Formants tell us about the shape of vocal tract during vowel production • Size of back cavity • Size of front cavity • The first three formants are usually enough to determine the quality of a vowel
First Formant (F1) • F1 correlates with vowel height • Low vowel = High F1 • Male = 750 Hz, Female = 850 Hz • High vowel = Low F1 • Male = 250 Hz, Female = 300 Hz • This is a pretty robust correlation
Second Formant (F2) • Second formant correlates with vowel frontness/backness • Low F2 = Back vowel • Male = 850 Hz, Female = 900 Hz • High F2 = Front vowel • Male = 2300 Hz, Female = 2800 Hz • However: • Lip rounding affects F2 • Vowel height has some effect on F2 as well
F1 and F2 i e ɛ ɨ ə a u o ɔ
Third Formant (F3) • F3 is typically higher for front vowels, but doesn’t vary much between central and back vowels • However, this is far from being systematic
Diphthongs • Some vowels are composed of a steady state and a glide • VN real diphthongs ia, ua, ưa • Sometimes, there is no or very little steady state, just gliding • Let’s look at VN diphthongs in Praat
Semi-vowels (glide) • On a spectrogram, semi-vowels look like vowels • They do not have a stable state, they just glide • The difference between vowels and semi-vowels is phonological, not phonetic • VN false diphthongs ai, ay, ao, au, iêu, oeo. (q)uơi…
Closures and releases • Closure: Silence • Release: Burst • Zero VOT: no lag between release and vowel • Praat: look at your own bursts • Double burst in velars
Voice • On a wide-band spectrogram, voiced stops are identified by: • their voice bar (negative VOT) • their vertical striations
Aspiration • Noise between the release and the vowel • Positive VOT
Places of articulation • No way of identifying the place of articulation from the closure • The cues are: • In the vowel transitions • In the aspiration (if any) • Release
Transitions • Locus • Apparent point of origin of the formants • Velars have two loci • F3 transitions are less important, but not negligible
Textbook example of transitions • Note the velar pinch
Importance of transitions • Same transitions in oral and nasal stops • The place of articulation of a stops (oral or nasal) can be retrieved from its transitions only
Aspiration • Frequency of noise is a good cue for place of articulation (not much use in VN) • Front cavity determines the frequency of noise • Front stops: high noise frequency • Backs stops: low noise frequency • No high-energy frequency range for labials because of the absence of a front cavity
Release • Velar stops often have a double burst • Typically, Vietnamese coda stops are not released, so only useful for onsets
Places of articulation 2500 Hz 4000 Hz
Noise frequency • Front fricatives have higher frequencies than back fricatives • Short front cavity: Low frequency • Long front cavity: High frequency • However, labio-dentals and interdentals have no front cavity, so no clear peak
Transitions • Fricative-Vowel transitions exist, but • Weaker than stop-vowel transitions • Mostly seen on F2 • Locus is the peak noise frequency of the fricative
Northern VN fricatives f v s z x ɣ
Affricates • Stop closure + fricative • No real burst • Not always
Nasals • Nasal stops • Closure in the mouth • Airflow through the nasal cavity
Nasal formants • The nasal cavity is like an unconstricted tube • Nasal formants are roughly the same for all places of articulation (they depend on the length and shape of the nasal cavity) • 250-500 Hz • 1750-2250 Hz • 2750-3250 Hz
Anti-formants • As the oral cavity is closed, some waves are absorbed. This is anti-resonance. • Anti-resonance (or anti-formants) depends in theory on the oral articulation • In practice, it is almost impossible to determine the place of articulation based on anti-formants • Overall, anti-formants reduce the amplitude of nasal formants
Nasal-Vowel transitions • Loci are the same as for oral stops • Same place of closure • Same effect on speech signal once the nasal cavity is closed • Transitions can be difficult to see in nasalized vowels, because they have both oral and nasal formants
Approximants • Approximants have formants • However, • These formants are more changing than vowel formants • The intensity of these formants is an important cue
Liquids • Formant structures similar to vowels • 500 Hz, 1,500 Hz, 2,500 Hz • Anti-formant • Because of the tongue closure, there is an anti-resonance around 2000 Hz
Southern VN approximants lu ru
Trills and taps • Usually, one can see interruptions in the spectrogram because trill vibration and taps correspond to brief closures
Pitch (and tone)
Pitch • Pitch corresponds to vocal fold vibrations • Thus, anyvoicedsoundcan have a pitch • In Vietnamese, tones tend to berealized on the entirerhyme • However, consonants have an allophoniceffect on pitch, and thus on tones
A Cham example • Story NVT • Designed to test the effects of intonation in Cham • Several tiers • See textgrid
Segmentation • Segment the Sun and Wind story (recorded by James Kirby) in small teams • In IPA (Lenz) • phoneme by phoneme • Files • 01a (3 sec) – together? • 01b (4 sec) • 01c (9 sec) • 01d (11 sec) • 01e (5 sec) • 01f (6 sec)
Glosses and translation • Normal practice • Native script (quốc ngữ) • Word for word glosses under the transcription • Full translation, sentence by sentence