1 / 59

Speech & Hearing Perception

Speech & Hearing Perception. Perry C. Hanavan. Recommendation. My Fair Lady (musical adaption) Pygmalion. Review. Peripheral Auditory Mechanism Outer ear (pinna & external auditory canal) Acoustic transmission Quarter wave resonator Middle ear (TM, ossicles, Eustachian tube, tympanum)

freedman
Download Presentation

Speech & Hearing Perception

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Speech & Hearing Perception Perry C. Hanavan

  2. Recommendation • My Fair Lady (musical adaption) • Pygmalion

  3. Review • Peripheral Auditory Mechanism • Outer ear (pinna & external auditory canal) • Acoustic transmission • Quarter wave resonator • Middle ear (TM, ossicles, Eustachian tube, tympanum) • Mechanical transmission/transduction • Inner ear (cochlea, semicircular canals, saccule, utricle) • Hydraulic transmission/transduction • Mechanical transduction • Auditory Nerve (afferent, afferent) • Chemical-electrical transmission

  4. Outer Ear • Pinna • External auditory meatus • Quarter wave resonator • The resonant frequency of the average adult ear canal is about 3000 Hz. • Smaller ear canals, like in children, have higher resonant frequencies around 4000 Hz

  5. Localization • 2 ears • The two most important localization cues are the interaural time difference, or ITD, and the interaural intensity difference or IID. • Head shadow effect of the sound wave: a sound coming from a source located to one side of the head will have a higher intensity, or be louder, at the ear nearest the sound source. • Phase differences also plays a role in localization

  6. Middle Ear • Impedance mismatch • Air vs. fluid • Area ratio hypothesis • Lever hypothesis (3:1) • Stiffness and mass have inverse effects on frequency in a resonant system: f=(1/2p)               • Mass dominated systems have a lower resonant frequency than stiffness dominated systems. • Increasing stiffness in any ear component (membranes, ossicles, cavity) improves the efficiency of transmission of high frequencies. • Adding mass to the system, e.g., by increasing cavity volume or increasing ossicular chain mass, favors low frequencies.

  7. Middle Ear

  8. Inner Ear • The cochlea is a fluid-filled spiral with a resonator, the basilar membrane, and neuroreceptor, the Organ of Corti • Inner ears are tuned in that inner ear stiffness and mass characteristics are major determinants of hearing ranges • Differences in hearing ranges are dictated largely by differences in stiffness and mass of the basilar membrane that are the result of basilar membrane thickness and width variations along the cochlear spiral. • Basilar membranes are essentially tonotopically arranged resonator arrays, ranging high to low from base to apex.

  9. Basilar Membrane

  10. Traveling Wave • http://www.lloydwatts.com/collaborators.shtml

  11. Inner vs. Outer Hair Cells

  12. Inner Ear Mechanics • Basilar membrane • Basilar membrane animations • Hair Cells • Outer Hair Cell Motility

  13. Central Auditory Path

  14. CNS • Cochlear nuclei (modulate motility of OHC, acoustic reflex) • Trapezoid Body • Superior Olivary Complex (reflexes centers for Moro, startle, auralpalpebral, acoustic reflexes) • Lateral Lemniscus • Inferior Colliculus • Medial Geniculate Body • Primary Auditory Cortex • Wernicke-s Area • Corpus Callosum

  15. Auditory CNS Path Central Auditory Pathway

  16. Excellent Brief Review • Review of Function

  17. Hearing Threshold

  18. Auditory Masking • Blocking or obscuring a sound • Simultaneous masking • Presentation of target sound and masking sound • Broadband Noise (BBN) vs Narrowband Noise (NBN) • Critical bandwidth (when using NBN) • Upward spread of masking • Central masking

  19. Precedence Effect • Fusion of sounds and initial echoes into one auditory event and the localization of that fused sound at the source of the earliest arriving sound • Stenger test used by Audiologists using this effect when individual suspected of malingering

  20. Equal Level Contours

  21. Music Analysis • Pure Tone (Periodic) • Periodic Complex Tone • Aperiodic Complex (Noise) • Fundamental – lowest tone in complex periodic sound • Harmonics – whole number multiple of fundamental • Missing fundamental – auditory illusion

  22. Fundamentals • 100, 200, 300 Hz (100) • 800, 900, 1000 Hz (100)

  23. Frequency • Place principle: Helmholtz suggested the basilar membrane resonate in specific places to a tone which Bekesy confirmed later • Frequency principle: Seeback and revived by Wever, suggested that the spike potentials of auditory nerve determines pitch • Volley principle: neurons fire in groups while one neuron is reloading another is firing

  24. Auditory Scene Analysis ASA: a concept created by Albert Bregman, is a process in which the auditory system takes the mixture of sound that it derives from a complex natural environment and sorts it into packages of acoustic evidence in which each package probably has arisen from a single source of sound. This grouping helps pattern recognition not to mix information from different sources. Online Examples • Compact disc of ASA Link • Segregating and Grouping

  25. Speech Production • Formants

  26. Speech Production • Phonemes (sound units of language) • Consonants (s, z) • Voiced (b, d, g) • Unvoiced (p, t, k) • Vowels (a, e, o, i, u) • Diphthongs (oy, ei)

  27. Formants • Vowels • Greater intensity, formant structure, all voiced, constriction of air flow less than consonant • Diphthongs • Vowel characteristics, but transition (glide) • Consonants • Less intensity, greater constriction of air flow

  28. Pattern Playback Haskins Laboratory

  29. Vocal Tract • Approximately 17 cm for males • 5/6 the length for females • Children roughly half the length of adult male

  30. Math Model for Vowel Formants • Formant Calculation Handout • Formant Plotting Handout • Excel Model

  31. Source Filter Fo (source produced at vocal folds) Formants (F1, F2, F3, …) created by vocal tract resonance Source which is emphasized and not modulated by vocal tract resonance (F1, F2, F3, shown at left)

  32. Perception of Vowels • /a/ vowel has greatest intensity with unvoiced /θ/ as weakest vowel • Front vowels perceived on basis of F1 frequency and average of F2 and F3, whereas back vowels are perceived on the basis of the average of F1 and F2, as well as F3 • So is it the absolute frequency values of the formants? • Or the ratio of F2 to F1? • Perhaps it is the invariant cues (frequency changes that occur with coarticulation F1/F2 F3 F1 F2/F3

  33. Formant with Tongue Position More pictorials

  34. Vowel Spectrograph

  35. Chart Vowel Formants • Acoustics and Tongue Position • Video Clip

  36. Lip Rounding

  37. Vowel Formants

  38. Online Examples of Formants • Sound to Graph • Spectral Cues Homepage

  39. Perception of Diphthongs • Perceived on basis of their formant transitions • Salient feature: rapidity of transition

  40. Diphthongs

  41. Consonants • Perception different for consonants than vowels • Greater variety of consonant types than vowels • Greater complexity for consonants

  42. International Phonetic Alphabet(consonants) • 26 letters of alphabet • abcdefghijklmnopqrstuvwxyz • Only list phonemes • bdfghjklmnprstvwz • Digraph phonemes • ch, sh, th • Other phonemes

  43. Production of Consonants • Place of production • Where major constriction occurs in vocal tract • Manner of production • How consonant is produced • Voicing • Voiced or unvoiced

  44. Place of Production Example of some consonant phonemes: • Bilabial p b m w • Labiodental f v • Dental th • Alveolar t d s z l r • Palatal ch sh • Velar k g ng • Glottal h

  45. Manner of Production Example of some consonants: • Stops p t k b d g • Fricatives f v s sh z h • Affricates ch dg • Nasals m n ng • Semivowels w l r j

  46. Voicing Examples of some consonants • Voiced b d g v z l r w • Unvoiced p t k f s h

  47. Stops • Produced with a closure within the oral cavity, a build up of pressure behind this closure and a release of the closure allowing the air to be rapidly expelled. • Acoustically these events can be divided into five components: • Occlusion • Transient • Frication • Aspiration • Transition • More info

  48. Fricatives Fricative production involves two articulators being brought together and held close enough for the escaping air to become turbulent creating an aperiodic (noise) sound. Maybe be voiced or unvoiced. The closure phase of fricatives is characterized by the continuant noisy aperiodic component. The characteristics of the noise are the result of the position of the constriction, the shape of the orifice, and the aerodynamic forces of the air stream. Acoustic characteristics include: High frequency hiss, long duration, weak to moderate intensity

  49. Affricates • Stop with a fricative release – but palatal. • Combination of stop and fricative characteristics. • Closure, burst followed by short silence then frication • The affricates can be distinguished from the fricatives by the presence of closure and by the duration of noise which is longer for the fricatives. • The shorter the duration of noise, the shorter the silence necessary to elicit an affricate response. • Affricates have a shorter rise time than fricatives. Rise time is the time from onset to peak intensity of frication.

  50. Nasals • Like the oral tract, the nasal tract has its own resonant frequencies or formants. • The most commonly reported nasal formants occur at 300Hz, 1kHz, 2.2 kHz, 2.9kHz, 4kHz. • Antiresonances enter whenever there is a side branch in the main acoustic pathway. An antiresonance or zero serves to decrease the spectral energy at specific frequencies by absorbing the sound at or near the antiresonant frequencies. These cumulatively have the effect of reducing the total amplitude of the sound generated.

More Related