1 / 33

The Effect of Incongruent Visual Cues on the Heard Quality of Front Vowels

The Effect of Incongruent Visual Cues on the Heard Quality of Front Vowels. Hartmut Traunmüller Niklas Öhrström Dept. of Linguistics, University of Stockholm. Background.

marius
Download Presentation

The Effect of Incongruent Visual Cues on the Heard Quality of Front Vowels

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Effect of Incongruent Visual Cues on the Heard Quality of Front Vowels Hartmut Traunmüller Niklas Öhrström Dept. of Linguistics, University of Stockholm

  2. Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. They were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

  3. Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. They were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

  4. Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. The vowels were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

  5. Background We have earlier carried out an AV perception experiment in which congruent and incongruent AV stimuli were presented to subjects. The AV stimuli consisted of different front vowels presented within a [g_g] frame. The vowels were incongruent with respect to openness (height) or roundedness or both. The subjects had to report which vowel they had heard. The response alternatives consisted of the nine letters that represent the long vowel phonemes of Swedish.

  6. Background Typical results: Visual roundedness combined with auditory openness.

  7. Background Explanation: Acoustic cues to openness (F1 etc.) are prominent and reliable. Acoustic cues to roundedness (higher formants) are less reliable. Optic cues to roundedness are prominent and reliable; rounded lips are easy to distinguish from unrounded. Optic cues to openness are less reliable because of variation due to individual habits, attitude and emotion.

  8. Background Explanation: Acoustic cues to openness (F1 etc.) are prominent and reliable. Acoustic cues to roundedness (higher formants) are less reliable. Optic cues to roundedness are prominent and reliable; rounded lips are easy to distinguish from unrounded. Optic cues to openness are less reliable because of variation due to individual habits, attitude and emotion.

  9. Background The mentioned experiment was designed with the objective of investigating categorical phonemic perception. However, subjects informally reported having heard vowels whose quality differed from that of ordinary Swedish vowels. Auditory rounding together with visual unrounding appeared to affect the heard backness quality of the vowel.

  10. Background The mentioned experiment was designed with the objective of investigating categorical phonemic perception. However, subjects informally reported having heard vowels whose quality differed from that of ordinary Swedish vowels. Auditory rounding together with visual unrounding appeared to affect the heard backness quality of the vowel.

  11. The present study The present experiment aims at exploring the effect of the optic signal on the finer phonetic, sub- categorical auditory perception of vowels.

  12. The present study We reused a subset of the stimuli from the previous experiment.

  13. The present study There were 4 speakers: 2 male, 2 female.

  14. The present study There were 8 perceivers: They were selected from a previous experiment where they had shown sensitivity to the optic signal in incongruent audiovisual stimuli. The 8 subjects were all phonetically skilled and familiar with the IPA-chart for vowels.

  15. The present study The subjects perceived the stimuli by way of headphones and a computer screen. The stimuli were presented in quasi-random order. Responses were given on electronic response sheets.

  16. The present study The subjects were instructed to rate the dimensions of the heard vowel (or of those seen in purely optical stimuli). • Lip rounding (6 degrees), 1st: unrounded; 5th: rounded • Lip spreading (3 degrees) • Openness (18 degrees), 2nd: close vowels, 6th: close-mid vowels • Backness (11 degrees auditorily; 7 degrees visually), 2nd: front vowels, 6th (auditorily): central vowels

  17. Results Openness opn vs. roundedness rnd; acoustic stimuli (listening only):

  18. Results Openness opn vs. roundedness rnd; optic stimuli (lipreading only):

  19. Results Openness opn of incongruent AV-stimuli vs. opn of A-stimuli: opn = 0.05 + 1.00 opnA (r2 = 0.97)

  20. Results Roundedness rnd of incongruent AV-stimuli vs. rnd of A-stimuli: (no significant correlation)

  21. Results Backness bac of incongruent AV-stimuli vs. rnd of A-stimuli: bac = 0.06 + 0.24 rndA (r2 = 0.66) bac = 0.06 + 0.25 rndA – 0.20 rndAV (r2 = 0.74)

  22. Results Openness opn of incongruent AV-stimuli vs. opn of V-stimuli: (no significant correlation)

  23. Results Roundedness rnd of incongruent AV-stimuli vs. rnd of V-stimuli: rnd = 0.05 + 0.82 rndV (r2 = 0.92) rnd = -0.03 + 0.86 rndV + 0.47 bacV (r2 = 0.95)

  24. Results Backness bac of incongruent AV-stimuli vs. rnd of V-stimuli: (significant negative correlation)

  25. Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: • The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. • F2’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

  26. Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: • The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. • F2’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

  27. Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: • The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. • F2’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

  28. Discussion Rated backness in incongruent stimuli is correlated with roundedness in the stimuli. There are two hypothetical explanations for this: • The distance from the lips to the dorso-palatal ’place of articulation’ is increased by lip rounding as well as by tongue retraction. This would provide an articulatory (gestural) explanation. • F2’ is lowered by lip rounding as well as by tongue retraction. This would provide an auditory explanation. Both explanations would be consistent with the placement of the rounded vowels to the right of their unrounded counterparts in IPA-charts.

  29. Discussion Analysis of perceived backness

  30. Discussion Analysis of perceived backness

  31. Discussion Analysis of perceived backness Conclusion: The effect is due to auditory (F2’) rather than articulatory (gestural) associations.

  32. Discussion We have seen that the perceived retractedness of A rounded but V unrounded vowels can be understood as due to a continuous auditory variable (F2’). The variation in perceived retractedness cannot be explained on the basis of a late-integration hypothesis, since Swedish lacks non-front unrounded vowel phonemes, whose existence would be required in order to apply such a hypothesis. This is clear and direct evidence for early, sub-categorical integration.

  33. x Thank you for your attention!

More Related