1 / 19

3-D Sound and Spatial Audio

3-D Sound and Spatial Audio. MUS_TECH 348. Wightman & Kistler (1989). Headphone simulation of free-field listening I. Stimulus synthesis II. Psychophysical validation. I. Stimulus synthesis. Goal is to be able to capture free-field listening acoustics with headphones. • 200-14,000 Hz

baakir
Download Presentation

3-D Sound and Spatial Audio

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 3-D Sound and Spatial Audio MUS_TECH 348

  2. Wightman & Kistler (1989) Headphone simulation of free-field listening I. Stimulus synthesis II. Psychophysical validation

  3. I. Stimulus synthesis Goal is to be able to capture free-field listening acoustics with headphones. • 200-14,000 Hz • Greater than 20 dB S/N (only 20 dB?) • 8 loudspeakers on movable arch creating 144 directions • With & without bite bar Measure loudspeaker-delivered HRTFs and compare to headphone-delivered HRTFs

  4. HRTF measurement system

  5. Variability in HRTF measurements Left ear Assembly replaced 10 times with bite bar Right ear Left ear Assembly left in place with no bite bar Right ear Headphone replacement with assembly in place

  6. HRTF intersubject variability

  7. II. Psychophysical validation Goal is to compare localization performance in free-field and headphone listening Stimuli: 8 250 msec noise bursts 200 - 14,000 Hz random spectral changes by critical band Presentation: 6 loudspeakers at a time mounted on arch headphones 72 positions Task: absolute judgment of azimuth and elevation no measure of distance or quality

  8. Types of Errors • Angle error (mean of difference angles) • Judgment centroid (average direction) • Dispersion of judgments • Front-back reversals are removed! (and examined separately) Results • Substantial individual differences • Less obvious in global measures • Most evident in elevation judgments • Performance varies with region • Best localization: side (contradicts other studies) • Worst localization: top rear • Free-field and headphone judgments very similar • More front-back reversals with headphones

  9. Headphone simulation data in parentheses

  10. SDE has most errors. SDO has fewest errors, especially for elevation.

  11. Elevation Dependency Function Interaural intensity difference compared to 0-degrees elevation Subject SDE’s poor elevation judgments could be explained by the lack of a coherent pattern

  12. Begault: Challenges to the Successful Implementation of 3-D Sound • Focus is on deployable systems, especially audio systems • Individual HRTFs can be quite different • Challenges: • Eliminate front-back reversals & improve externalization • Reduce HRTF data load • Resolve conflicts in data specifications

  13. Begault: Challenges to the Successful Implementation of 3-D Sound Mismatch of Specification and Performance Success depends on: HRTFs: some work better than others different sets create timbral percepts Input sounds broadband sounds localize better Specification Have reasonable expectations What kinds of HRTFs to use for systems? General HRTFs designed for average listeners HRTFs of good localizer

  14. Reality vs Ideal From Begault and Wenzel, 1993

  15. Begault: Challenges to the Successful Implementation of 3-D Sound • Localization error • For dummyhead recordings, 30% of locations suffer reversals • 4:1 front-back vs back-front • Many sounds not externalized • Low-frequency Response Errors • Measurement equipment can’t get it right • Data-reduction for HRTFs • Reduce the number of coefficients • Alternative Strategies like pole-zero modeling

  16. Martens: Perceptual evaluation of filters controlling source direction: Customized and generalized HRTFs for binaural synthesis Focus is on systems supporting directional hearing with special consideration on HRTF design Position of sound source and position of auditory event do not always coincide, but that is not necessarily an issue of accuracy Sound localization might better be called space perception

  17. Martens: Perceptual evaluation of filters controlling source direction: Customized and generalized HRTFs for binaural synthesis Binaural Synthesis Good localizer HRTFs not supported by evidence Given the variety of approaches to binaural synthesis, better to use the term Directional Transfer Functions (DTFs) when they are created analytically Target Exact Analytic One Individualized HRTFs Customized DTFs Many Averaged HRTFs Generalized DTFs Performance evaluation (in additional azimuth and elevation); Externalization Range Coherence Naturalness

  18. Martens: Perceptual evaluation of filters controlling source direction: Customized and generalized HRTFs for binaural synthesis Binaural Synthesis Evaluation What features are needed to make binaural synthesis “ear adequate” Binaural cues can be based on analysis and selected resynthesis Principle Components Analysis (PCA) Selective Reconstruction (for example, leaving out phase information [Pole-zero design] Elevation judgments needed only three out of four cues: ipsilateral magnitude interaural magnitude ipsilateral phase interaural phase

  19. Martens: Perceptual evaluation of filters controlling source direction: Customized and generalized HRTFs for binaural synthesis Customizing HRTFs Calibration methods: Anthropometric (anatomy) Acoustic (HRTFs) Psychophysical (perception) Source Range Ipsilateral gain and contralateral attenuation are important

More Related