Applied Psychoacoustics Lecture 3: Masking

Applied PsychoacousticsLecture 3: Masking Jonas Braasch

From ATH to masked detection thresholds from tonmeister.ca after Zwicker & Fastl 1999 So far we have measured the absolute threshold of hearing (ATH) throughout the auditory frequency range for sinusoids. Now we would like to investigate how we detect sounds if other sounds are present as well.

Preliminary thoughts • We have seen that a sine sound does not only excite the part of the basilar membrane that corresponds to its frequency but also other frequencies as well (traveling waves). • The traveling wave move from the base (high freqs.) to the apex (low freqs.) and declines after passing the resonance frequency. • Therefore, we expect that a sound at a given frequency also affects the detection of a sound at another frequency. • We will utilize this effect to the determine the shape of auditory filters.

Method We now re-measure the threshold of hearing, but in this case we present to sinusoids to the listeners. • One is fixed in frequency (1 kHz) and level (60 dB SPL). • The second one is varied as before. • We measure at various frequencies the minimum sound pressure level at which the second tone is detected.

Absolute Threshold of Hearing from tonmeister.ca after Zwicker & Fastl 1999 Our absolute threshold of hearing (ATH) for a single tone now changes to …

Masked Detection Threshold shallow slope for high freqs steep curve for low freqs from tonmeister.ca after Zwicker & Fastl 1999 … to this one. We now speak of the masked detection threshold, with the sinusoids at 1 kHz, 60 dB SPL being the masker. Note that we find a steep slope just below 1 kHz, because in this range the basilar membrane is not much affected by the sound, while the slope is shallow for frequencies just above 1 kHz.

Masked Detection Threshold from tonmeister.ca after Zwicker & Fastl 1999 Of course the masked detection threshold depends on the characteristics of the masker. In this graph, several thresholds curves are shown for various masker levels (20-100 dB SPL). 100 dB SPL is a very high value. I recommend NOT to go above 85 dB SPL if you want to repeat this measurement at home.

Auditory Filters • Fletcher (1940) postulated that the auditory system behaves like a bank of pass-band filters with overlapping passband. • Helmholtz (1865) already had similar ideas. • Auditory filters can be measured in amplitude and phase as functions of frequency.

Measurement of auditory filters • We will firstly restrict ourselves to the amplitude of auditory filters. • We can use a similar paradigm to measure auditory filters as in our previous experiment: • present two sinusoids with same level and same frequency to the listener. The level was adjusted just above sensation level. • Next, we vary the frequency of the sinusoids in opposite direction. • The two sinusoids become inaudible at the point where they do not fall into one auditory filter anymore. In this case, the energy within each of the two auditory filters becomes to small to be detected.

Critical Bands: Zwicker (1961) (Fig.:Terhardt 1998) • Zwicker (1961) measured the critical bandwidth using two narrow-band maskers which masked a sine target at the center of the critical band. • Zwicker recorded the detection threshold of the sine tone (varied in level) as a function of the frequency gap between both maskers. • Unfortunately, the interference between the lower frequency noise masker and an the sine target lead to interference effects, and combination tones at different frequencies become audible, while the signal remains undetected. • This leads to the abrupt decrease in the detection threshold at 0.3 kHz.

Zwicker’s critical bands Linear range contradicts new findings

Zwicker’s critical band rate Errors between measured and predicted values (from equations below) The graph shows the Critical band rate in Bark as a function of Frequency f. The equations were established to fit the data. Previous slide

Zwicker’s critical bandwidth Errors between measured and predicted values (from equations below) The graph shows the Critical band width in Bark as a function of Frequency f. The equation was established to fit the data. Previous slide

Patterson ‘74: Measurement method from Patterson (1974) Hypothetical MASKING Shaded area: part of noise that is effectively masing the test tone Patterson (1974) used a broadband noise masker to avoid harmonicity to influence the results.

Patterson ‘74: Measurement method

Patterson 74: Results

Off-Frequency Listening on-frequency listening masker Auditory filter tone off-frequency listening masker Auditory filter tone By placing the center frequency of of the Auditory Filter above the test-tone frequency, the signal-to-noise ratio between the tone and the masker can be increased. This way the test tone is easier targeted.

Avoiding off-Frequency Listening Auditory filter on-frequency listening masker 2. masker tone Df Df Auditory filter 2. masker off-frequency listening masker tone In this two masker case off-frequency listening does not pay off anymore. By shifting the auditory filter, the influence of one masker is reduced, while the influence of the 2. masker is increased. Overall the signal-to-noise ratio balance decreases. In this experiment, it is assumed that the auditory filter is symmetrical, which is a good-enough approximation.

Noise Gap Masking (Fig.: Moore 2004) To avoid off-frequency listening, Patterson (1976) measured the threshold of the sinusoidal signal as a function of the width of the spectral notch in the noise masker. The shaded areas shows the amount of noise passing through the auditory filter.

Auditory filter shape (Fig.: Moore 2004) Typical shape of an auditory filter as measured by Patterson (1976). The center frequency is 1 kHz.

Auditory Filter non-linearity 80 dB 30 dB This graph shows the non-linearity of the auditory filter. In the left graph the filter curves Are normalized to 0 dB. The filters were measured for several 2-kHz sine tones from 30 to 80 dBs. Note how the filter broadens toward low frequencies with increasing level. In the right graph the filters were not normalized. (Fig.: Moore 2004)

Auditory Filter Bandwidths (Fig.: Moore 2004) Width of auditory filters measured with different techniques. The dashed curve shows the values of Zwicker (1961), the solid line the ERBN values which was measured using Patterson’s (1976) notched-noise method. Note the large deviations of Zwicker’s results at low frequencies, which are based on indirect measures.

ERB calculations ERBN in Hz, f=frequency in kHz ERBN # in Hz, f=frequency in kHz Glasberg and Moore (1990)

Psychophysical Tuning Curves Fig.: Moore 2004 The psychophysical tuning curves (PTC) were determined by measuring the masked detection thresholds for 6 sine tones which were presented 10 dB above sensation level (black circles). The masker was a sine tone as well which was varied in level (Data from Vogten, 1974).

Amplitude Modulation Amplitude Modulation (AM) m=modulation index (m=0 no modulation, m=1 100% modulation), fc=frequency in Hz fc fc−g fc+g Sidebands f

Frequency Modulation (FM) Frequency Modulation (FM) b=modulation index (b=0 no modulation, b=1 100% modulation), fc=carrier frequency in Hz, g=modulation frequency in Hz. For low modulation indexes, we can simulate the FM signal using quasi-frequency modulation (QFM), which consists of three sinusoids with appropriate amplitudes and phases: Quasi-Frequency Modulation (QFM) with fc1= fc2−g, fc1= fc3+g AM and QFM differ only in phase, but not in amplitude. This feature makes the stimuli interesting for psychoacoustic experiments. If the listeners do not respond differently to both stimuli, the underlying processes are most likely not dependent on phase.

Amplitude vs. Frequency Modulation (Fig.: Moore 2004) Amplitude Modulation (AM) Frequency Modulation (FM)

Perception of Modulation • For low modulation frequencies (e.g., g=5 Hz): • Amplitude modulation is perceived as loudness fluctuation • Frequency modulation is perceived as frequency fluctuation

Critical Modulation Frequency (CMF) AM and QFM modulationthresholds for a 1-kHzsinusoidal tone as a function of the modulation frequency g. In both cases the threshold decreases with modulation frequency. The lower graph shows the ratio b/m. At 90 Hz, the so-called critical modulation frequency (CMF) the ratio becomes one. Above this frequency, which highly correlates with the width of the auditory filter, the auditory system becomes insensitive towards the phase of the components: ->The phase of different frequency components plays only a role if they are processed by the same frequency band! (Fig.: Moore 2004)

Audibility of single partials of a complex tone (Fig.: Moore 2004, after data of Plomp, 1964a; Plomp and Mimpen, 1968). The x’s and open circles show the minimal separation frequency as a function of partial frequency above which the partial can be heard out with 75% accuracy. The long-dashed curve shows the ERBN function ×1.25. Basically, partials cannot be heard out, if its share the same auditory band with other partials.

Masking Patterns (Fig.: Moore 2004, data from Egan and Hake, 1950) Masking patterns (audiograms) for a narrow band of noise centered 410 Hz. The curves show the increase in threshold for a sinusoidal signal as a function of frequency. The number above each curve gives the SPL of the noise masker.

Excitation Patterns (Fig.: Moore 2004) Estimation of the excitation pattern from auditory filterbank data for a 1-kHz sinusoid. For each filter band the filter amplitude at the frequency of the test tone is determined (points a-e). Afterwards, these points are plotted at the center frequency of the corresponding filter, which represents the excitation pattern.

Excitation patterns The figure shows the excitation patterns for the 1-kHz sinusoid for various sound pressure levels from 20 to 90 dB in steps of 10 dB. (Fig.: Moore 2004)

Co-Modulation Masking Release 1. 1. target 1. The masker increases the detection threshold of the target masker f 2. The threshold is not affected if a second masker is presented far enough in frequency 2. f 3. However, if we co-modulate both maskers, the detection threshold is lowered (co-modulation masking release) 3. co-modulated masker f

Co-Modulation Masking Release (Fig.: Hall et al. 1984) Masker components NOT co-modulated Masker components co-modulated Test tone: 1-kHz sinusoid Masker: Band-pass filtered noise

PET System Peripheral Ear Transfer Functions at the basilar membrane (different auditory filters or positions at the basilar membrane). (Fig.:Terhardt 1998)

Filter Response of CTF Filter Magnitude Phase of Cochlear Transfer Functions (Fig.:Terhardt 1998)

Temporary masking

Noise vs. tonal masker 410 Hz, 90 Hz bandwidth 400 Hz Egan and Hake (1950)

Applied Psychoacoustics Lecture 3: Masking