650 likes | 659 Views
Explore the critical bands in masking, power spectrum model, loudness scaling, Stevens’ Power Law, and measurement techniques in auditory perception. Discover the complexities of masking and its impact on perceived loudness.
E N D
Auditory Perception Rob van der Willigen http://~robvdw/cnpa04/coll1/AudPerc_2007_P7.ppt
Today’s goal Understanding masking: Critical bands in masking Power spectrum model of masking Measurement of masking
Psychoacoustics SPL is not a measure of Perceived Loudness • Defined as the attribute of auditory sensation in terms of which sounds can be ordered on a scale extending from quiet to loud. • Two sounds with the same sound pressure level may not have the same (perceived)loudness • A difference of 6 dB between two sounds does not equal a 2x increase in loudness • Loudness of a broad-band sound is usually greater than that of a narrow-band sound with the same (physical) power (energy content) Recapitulation last weeks’ lecture
Psychoacoustics Perceived Loudness: phone • A unit of LOUDNESS LEVEL (L) of a given sound or noise. • Derived from indirect loudness measurements • If SPL at reference frequency of 1kHz is X dB • the corresponding equal loudness contour is the X phon line. • Phon units can’t be added, subtracted, • divided or multiplied. • 60 phons is not 3 times louder than 20 phons! • The sensitivity to different frequencies is more • pronounced at lower sound levels than at higher. • For example: a 50 Hz tone must be 15 dB higher • than a 1 kHz tone at a level of 70 dB Recapitulation last weeks’ lecture
Psychoacoustics Loudness Scaling: Magnitude of perceptual change Fechner predicted that a JND for a faint background produces the same difference in sensation as does the JND for a loud stimulus. Thus, a scale of S (Loudness) should be derivable by counting intensity jnds Measure of loudness: sensation intensity (S) in JND units Recapitulation last weeks’ lecture
Psychoacoustics Loudness Scaling: Magnitude of perceptual change • Consequences of a logarithmic Loudness function: • Changes from 15 to 30 dB should be the same as the change from 30 to 60 dB. • If loudness additivity holds, two tones at 70 dB should sound as loud as one tone at 140 dB • What if the jnd does not represent a constant change in loudness? • How could this be? • The jnd is determined by two things: • 1) Perceptual distance (change in loudness) • 2) Internal noise Fechner assumed (incorrectly) that internal noise is constant. Measure of loudness: sensation intensity (S) in JND units Recapitulation last weeks’ lecture
Psychoacoustics Loudness Scaling: Stevens’ Power law Another function relating Loudness S is Stevens’ power law: The exponent m describes whether sensation is an expansive or compressive function of stimulus intensity. The coefficient a simply adjusts for the size of the unit of measurement for stimulus intensity threshold above the 1-unit stimulus. =0.3 Recapitulation last weeks’ lecture
Psychoacoustics Scaling: Stevens’ Power law
Psychoacoustics Loudness Scaling: sone vs. phon SONE: a unit to describe the comparative loudness between two or more sounds. One SONE has been fixed at 40 phons at any frequency (40 phon curve). 2 sones describes sound two times LOUDER than 1 sone sound. A difference of 10 phons is sufficient to produce the impression of doubling loudness, so 2 sones are 50 phons. 4 sones are twice as loud again, viz. 60 phons. p is the base pressure of a sinusoidal stimulus, po is its absolute threshold.
Psychoacoustics Loudness Scaling Depends on: Number of excited hair cells (hence bandwidth of sound) Excitation of each cell (energy in each auditory filter)
Psychoacoustics Measuring Sound: Frequency Domain
The Intensity Density Level of three types of NOISES: Psychoacoustics Physical parameters of sound waves: Power Spectrum Density WHITHE NOISE BROWN (RED) NOISE GRAY NOISE Intensity density level [dB] Log Frequency [Hz]
Low Pass High Pass Frequency Frequency Band Pass Band Reject Frequency Frequency Psychoacoustics Measuring Sound: Filter Characteristics
Acoustic Filtering of the Auditory system: A-weighting The shapes of equal-loudness contours have been used to design sound level meters (audiometer). At low sound levels, low-frequency components contribute little to the total loudness of a complex sound. Thus an A weighting is used to reduces the contribution of low- frequencies.
Acoustic Filtering of the Auditory system: Audiograms of non-humans also shows weighting
Psychoacoustics Measuring Sound: Filter boundaries
Psychoacoustics What is Masking? • “The process by which the threshold of audibility for one sound is raised by the presence of another (masking) sound.” • (American Standards Association, 1960) • How can masking occur? • 1) Excitation: Swamping of neural activity due to masker. • 2) Suppression: Reduction of response to target due to presence of masker.
Psychoacoustics What is Masking? Simultaneous / Time Shifted The presence of one sound masks (hides) the presence of another A loud sound will mask a quieter sound (even if presented before (forward masking) or after (backward masking) the quieter sound) e.g. Given a masking tone of 400 Hz 70dB SIL, a 600 Hz has to be >100 dB SIL above its minimal threshold level (i.e., threshold in quiet) in order to become audible in presence of this 400 Hz masker tone.
Psychoacoustics Temporal aspects of Masking (1) Post-stimulus/Forward/Post-masking: 1st Masker 2nd test tone (2) Pre-Stimulus/Backward/Pre-masking: 1st test tone 2nd Masker (3) Simultaneous Masking: Test tone and Masker together
Psychoacoustics Two Definitions of Masking (1)The process by which the threshold of audibility for one sound is raised by the presence of another (masking) sound: • Masking is the reduction in audibility of one sound caused by the presence of another sound. (2)The amount by which the threshold is raised by the masker (in dB): • The amount of masking is the difference between the threshold for the target sound with no masker and the threshold for the target sound with the masker.
Psychoacoustics Critical bands in Masking Fletcher (1940) conducted an simultaneous masking experiment in which there was band-pass noise and a single sine wave. The frequency of the sine wave was always at the center frequency of the noise, and the power density of the noise was fixed. The bandwidth of the noise was varied, and for each bandwidth the minimum intensity at which the sine wave could be perceived was determined. With increasing bandwidth, the total energy of the noise increased.
Psychoacoustics Critical bands in Masking Handbook of Psychology By Irving B. Weiner, Donald K. Freedheim, John A. Schinka, Wayne F. Velicer, Alan M. Goldstein http://books.google.com/books?id=fErelr18MEUC&pg=PA87&lpg=PA87&dq=%22Fletcher+(1940)+%22++masking+experiment&source=web&ots=vz3C3Mzhgb&sig=EgANuNFgxcVLWlmnj9oWQYIDD9I#PPA88,M1
Psychoacoustics Critical bands in Masking
Psychoacoustics Critical bands in Masking A sine (signal) in the presence of noise that has a band width (in frequency) centered around the signal. critical band The wider the noise bandwidth the more the signal (sine wave) is masked. Past a particular (frequency) band-width beyond which the threshold doesn’t increase.
Critical band 150 Hz 300 Hz 400 Hz Auditory filter bandwidth 450 Hz 600 Hz Physical bandwidth SPL (dB) Frequency(Hz) 2000 Hz Psychoacoustics Critical band: ERB The transition point of the auditory filter is known as the Critical Band. This has also been termed the Equivalent Rectangular Bandwidth (ERB). • The critical band is the • point at which thresholds • no longer increase. • Conceptually very • powerful, but not much • use in providing an • accurate estimate of filter bandwidth. • Not possible to discern • filter shape from results.
Critical band = ERB 400 Hz Auditory filter bandwidth SPL (dB) Frequency(Hz) 2000 Hz Psychoacoustics Critical band versus Critical Ratio
Critical band = ERB 400 Hz Auditory filter bandwidth SPL (dB) Frequency(Hz) 2000 Hz Psychoacoustics Critical band versus Critical Ratio • The critical ratio is the • S/N ratio required to detect a pure tone at masked threshold • Conceptually not very • powerful, but much • Easier to estimate than the critical bandwidth. • Not possible to discern • filter shape from results.
Psychoacoustics Tonal Probe Masking? • When a tone is masked by a broadband noise, • masked threshold: • Increases slightly at higher probe frequencies • Decreases dramatically at higher probe frequencies • Is exactly the same at all probe frequencies • Can be lower than absolute threshold !!!!
Psychoacoustics Masking Curves versus ISO-L curves (left Column) Probe threshold, Lp, or Masker threshold, Lm, plotted with fp, as independent variable, will be referred to as "masking curves." (right column) Curves for a fixed probe frequency and with fm as the independent variable will be referred to as "iso-Lp curves" when the masker level Lm, (at probe threshold) is plotted as a function of fm. For plots of the probe level Lp as a function of the masker frequency we will use the term "iso-Lm curves."
Psychoacoustics ISO-L Curves Curves for a fixed probe frequency with fm as the independent variable will be referred to as "iso-Lp curves" when the masker level Lm, (at probe threshold) is plotted as a function of fm. As usual, we start with a tone for a probe, and we’ll use a broadband noise as a masker. These curves show masked threshold as a function of frequency, for different masker spectrum levels, ranging from -10 to 60 dB SPL. The lowest curve is an idealized audibility curve. Notice that the masking contours are flatter than the audibility curve; frequency doesn’t make much difference in masked threshold, although it does increase some at higher frequencies. Also notice that the curves are parallel to each other: At all frequencies, if you increase the spectrum level of the noise by 10 dB, masked threshold increases by the same amount at all frequencies, 10 dB. The masking contours do not extend beyond the audibility curve, because at very low levels the masker has no effect. In other words, if you can’t hear the masker (in the frequency region of the probe), masked and unmasked threshold will be the same.
Psychoacoustics ISO-L Curves Curves for a fixed probe frequency with fm as the independent variable will be referred to as "iso-Lp curves" when the masker level Lm, (at probe threshold) is plotted as a function of fm. As usual, we start with a tone for a probe, and we’ll use a broadband noise as a masker. 10 dB increase in masker level leads to a 10 dB increase in masked threshold (amount of masking) Once the masker becomes audible, a 10 db increase in masker level leads to a 10 dB increase in masking. This is true for all tone frequencies.
Psychoacoustics MASKING CURVE Experimental procedure: The procedure for a masking experiment. (a) The threshold is determined across a range of frequencies. Each arrow indicates a frequency where the threshold is measured. (b) The threshold is re-determined at each frequency (small arrows) in the presence of a masking stimulus (large arrow)
Psychoacoustics The Masking Curve Shown is the hearing curve (red) and a single tone (sine-wave) with a frequency of 1kHz (black). The green curve is the masking curve due to that tone.Indicates the amount that the threshold is raised in the presence of a masking noise centered The band of noise in yellow at a centre frequency of about 1.5kHz cannot be perceived by the human ear because of the masking effect of the tone at 1kHz.
Psychoacoustics ISO-Lp curves (Lm versus Fm): Psychophysical Tuning Curve Experimental procedure: First, a low level test tone is presented. Then, masking tones are presented with frequencies above and below the test tone. Measures are taken to determine the level of each masking tone needed to eliminate the perception of the test tone. Assumption is that the masking tones must be causing activity at same location as test tone.
Psychoacoustics ISO-Lp curves (Lm versus Fm): Psychophysical Tuning Curve To answer this question, consider this experiment. A tone is detected in the presence of another tone (I.e., the second tone is the masker). We fix the level of the probe tone, and we vary the level of the masking tone to find the threshold for the probe. The threshold for the probe is estimated for lots of masker tone frequencies around the probe frequency. What will the plot of masker level at threshold as a function of masker frequency look like? Notice the upward spread of masking: low frequencies mask high more than high mask low.
Psychoacoustics ISO-Lp curves (Lm versus Fm): Psychophysical Tuning Curve The psychophysical tuning curve is determined by measuring the sound pressure of each masking tone that reduces the perception of the test tone to threshold. The procedure for measuring a psychophysical tuning curve. (a) A 10-dB SPL test tone (blue arrow) is presented. (b) Then a series of masking tones (red arrows) are presented at each frequency.
Psychoacoustics Psychophysical tuning curves: ISO-Lp curves (Lm versus Fm) Psychophysical Tuning Curves (PTCs): Fixed signal; masker level adjusted to just mask signal. Psychophysical tuning curves for a number of test-tone frequencies (dots). Notice how the minimum masking intensities for the curves match the shape of the audibility curve (dashed line). (Based on Vogten, 1974). Advantages: Concept v. similar to neural tuning curves, allowing direct comparisons. Potential problems: “Off-frequency listening” Detection of beats if using a sinusoidal masker.
Psychoacoustics Psychophysical tuning curves: ISO-Lp curves (Lm versus Fm) (a) Three human psychophysical tuning curves generated using the method described in right figure. The arrows show the frequency of three different test tones. You can see from the figure that when the masking tone is the same as, or close to, the test tone in frequency, the intensity of the masker needed to mask the test tone is low. (b) Three neural tuning curves showing the stimulus intensity needed to generate a constant response (firing rate) in the nerve fiber of a cat. Each curve represents a different auditory nerve fiber The procedure for measuring a psychophysical tuning curve. A10 dB test tone (black arrow) is presented and then a series of masking tones (red arrows) are presented at the same time as the test tone. The psychophysical tuning curve is generated by determining the SPL threshold of the masking tones needed to reduce the perception of the test tone to threshold Vibration patterns on the basilar membrane caused by 400, 800 and 1000 Hz tones
Psychoacoustics Psychophysical tuning curves versus Frequency Tuning Curves Frequency Tuning Curves (FTCs): measured by finding the pure tone amplitude that produces a criterion response in an 8th nerve fiber. Psychophysical Tuning Curves (PTCs): Fixed signal; masker level adjusted to just mask signal.
Psychoacoustics Summary ISO-Lp curves (Lm versus Fm) or PTC Resulting tuning curves show that the test tone is affected by a narrow range of masking tones. Psychophysical tuning curves (PTC) show the same pattern as neural tuning curves which reveals a close connection between perception and the firing of auditory fibers Advantages: Concept v. similar to neural frequency tuning curves (FTC), allowing direct comparisons. Potential problems: “Off-frequency listening” Detection of beats if using a sinusoidal masker.
Psychoacoustics TWO_TONE SUPRESSION In single auditory nerve recordings, the response to a just supra threshold tone at CF can be reduced by a second tone, even though the tone would - itself have increased the nerve's firing rate. A similar effect is found in forward masking. The forward masking of tone a on tone c can be reduced if a is accompanied by a third tone b with a different frequency, even though b has no effect on c on its own.
Psychoacoustics Iso-Lm curve (Lp versus Fp): Masked Audiogram Lp as function of Fp Masking curves (masked audiograms) for a narrow band of noise centered at 1 kHz and bandwidth of 160 Hz. (Lm is constant) Each curve shows the elevation in the threshold of sinusoidal signal as a function of signal frequency. That is: for a fixed narrowband masker, the change in threshold for a single-tone probe over a specific frequency range is determined. The overall noise level of each curve is indicated in the figure.
Psychoacoustics Shape of auditory filter: Excitation patterns The shape of auditory filters as determined from the shape of tuning curves in masking experiments
Psychoacoustics Shape of auditory filter: Power Spectrum Model The auditory filters can be approximated by rectangular filters, but better determination of the filter shape is possible. The critical bandwidth at a particular frequency can be estimated using the formula where P is the intensity of the signal, N0 is the noise power over a 1-Hz range, K is the threshold of detectability (usually 0.4), and W is the critical band width (CB). N0 is independent of frequency. For example, the CB at 1000 Hz is 160 Hz; however, in reality rectangular filters are not accurate; the shape changes with frequency and amplitude Better approximations of the auditory filters look like this: 160 Hz Auditory filter bandwidth Critical band SPL (dB) Frequency(Hz) 1000 Hz
Psychoacoustics Power Spectrum model of Masking • Fletcher's experiment led to a model of masking known as the power-spectrum model that is based on the following assumptions: • 1. The peripheral auditory system contains an array of linear overlapping band-pass filters. • The non-linearity of the filters is now well known. • 2. Listener detect signals by using just one filter with a center frequency close to that of the signal. • Listeners clearly combine information across filters • 3. Only the components of the noise which pass through the filter have any effect in masking the signal. • Energy outside the filter can play an important role (see literature on informational masking and co-modulation masking release) • 4. Detection threshold is determined by the amount of noise passing through the filter, calculated as the ratio of the long-term power spectra of signal and noise. • Fluctuations in the masker can play a strong role
Hypothetical auditory filter masker masker signal Psychoacoustics Shape of auditory filter: notched noise method
Hypothetical auditory filter masker masker signal Psychoacoustics Shape of auditory filter: notched noise method
Psychoacoustics Shape of auditory filter: notched noise method Narrow versus Broad filters: As the notch width increases, the amount of noise passing through a narrow filter drops off faster than through a broad filter. Therefore, threshold will improve more quickly for a narrow filter than for a broad filter, as the notch width increase.
Psychoacoustics Shape of auditory filter: notched noise method Narrow versus Broad filters: As the notch width increases, the amount of noise passing through a narrow filter drops off faster than through a broad filter. Therefore, threshold will improve more quickly for a narrow filter than for a broad filter, as the notch width increase.
Psychoacoustics Shape of auditory filter: notched noise method Narrow versus Broad filters: The top curve would be for a broad filter and the bottom curve for a broad filter.