Communicating in a Natural Cocktail Party: Relating Human and Animal Behavior to Neural Response

Communicating ina Natural Cocktail Party:Relating Human and AnimalBehavior to Neural Response Barbara Shinn-Cunningham Boston University Auditory Neuroscience Laboratory

Michele Dent Liz McClaine Kamal Sen Rajiv Narayan Erol Ozmeral Erick Gallun Gin Best With funding from ONR, AFOSR, & NIH

The everyday acoustic environment is full of competition and clutter • The “Cocktail Party Problem” (Cocktail Party by SLAW, Maniscalco Gallery)

Penguins and other social birds suffer from the cocktail party problem Penguins recognize their mates and offspring amidst thousands of birds Chicks identify parents from 11 m -- when call is 6 dB below the level of the background noise (thanks to M. Dent)

Zebra finches learn to make a call by listening to a tutor… while in a large colony • Zebra finches are a model system for studying • vocal production learning • hierarchical encoding of complex signals (e.g., “birds own song” neurons; Narayan et al., 2006)

How do we figure out what is in the world from the sound mixtures we hear? • time (sec) • air pressure

How do we figure out what is in the world from the sound mixtures we hear? • time (sec) • Syllables / words heard as units • Confusions occur between sources (streaming over time) • air pressure

Frequency analysis breaks sound into parallel channels • time (sec) • mechanical vibration in air • neural firing (electrical spikes) in auditory nerve • low • When sounds overlap in their spectral content, neural responses are a mixture • air pressure • high

Spectrotemporal structure of sound is critical(in contrast with simple, “traditional” stimuli) • Supports segregation of competing source “units” • harmonicity • common onsets • comodulation • Reduces likelihood of spectrotemporal overlap (important elements are unlikely to be masked) • Moreover, theimportant information is contained in the spectrotemporal structure of sound. • BUT… • removing linguistic/semantic effects may tease out different contributing mechanisms

This project uses birdsongs from male zebra finch • Can compare results with avian behavior (Dent lab) and neurophysiological responses (Sen lab) • Spectrotemporal structure supports segregation

Listeners were trained to identify individual bird songs Moe Uno Toro Nibbles Junior

A quick test for you… • What is this? • What is this? Moe Uno Toro Nibbles Junior

Short-term “units” segregate, but streaming errors occur for similar sources • What is this? • + Moe Nibbles • What is this? Moe Uno • + Uno noise Toro Nibbles Junior

Three maskers, to tease apart different types of interference

Main type of interference Effect of spatial separation Reduce audibility Improve audibility through acoustic better-ear effects ? ? Cause streaming confusions Allow spatial attention to combat confusions Why these maskers? Spectrotemporalstructure DenseMuch overlap Noise More sparseTemporal control for chorus Mod. Noise SparseKey target features audible Chorus

separated • co-located Better-ear effects: the Target-to-Masker Energy Ratio improves with separation • separated • co-located

Binaural effects: Interaural decorrelation causes masked signal to be audible • Running cross-correlation output for 500-Hz channel • (simple model of brainstem processing in Medial Superior Olive)

Binaural effects: Interaural decorrelation causes masked signal to be audible • Running cross-correlation output for 500-Hz channel • (simple model of brainstem processing in Medial Superior Olive) • Important for signals below about 1500 Hz, but the birdsongs have a lot of high-frequency information

Hypothesized role of spatial attention in complex settings ?

Measure performance with and without spatial separation of target / masker Masker Masker

M M Quantify spatial unmasking = improvement in threshold Co-located Separated

M Spatial Unmasking M M - M Quantify spatial unmasking = improvement in threshold Co-located Separated

M M For birdsong, better-ear energy effects are large No better ear Better ear

M Is there an additional benefit of perceived separation? Diotic versus binaural Diotic

M Is there an additional benefit of perceived separation? Diotic versus binaural Diotic • Better-ear benefit • No binaural processing - Sources perceived at same location

Binaural • Better-ear benefit • Maybe binaural processing - Sources perceived at different locations M M Is there an additional benefit of perceived separation? Diotic versus binaural Diotic • Better-ear benefit • No binaural processing - Sources perceived at same location

Main type of interference Effect of spatial separation Diotic vs. binaural performance? Reduce audibility Improve audibility through acoustic better-ear effects Identical ? ? ? Cause streaming confusions Allow spatial attention to combat confusions Binaural much better than diotic performance Why these maskers? Noise Mod. Noise Chorus

For co-located target/masker, the chorus causedthe most interference worse performance (need louder target) Target threshold (dB re: masker target)

Spatial separation causes unmasking due to better-ear acoustics (diotic presentation) improvement with spatial separation Identify Target Best et al. 2005

Size of acoustic effect decreases as masker becomes sparser (audibility less of a problem) Spatial separation causes unmasking due to better-ear acoustics (diotic presentation) improvement with spatial separation Identify Target Best et al. 2005

No advantage from spatial attention or binaural processing (high-frequency content) For dissimilar maskers,there is no added benefit of perceived spatial separation improvement with spatial separation Identify Target Best et al. 2005

Perceived separation adds 10 dB of spatial unmasking, for confusable masker No advantage from spatial attention or binaural processing (high-frequency content) For a chorus masker, perceived location differences improve identification improvement with spatial separation Identify Target Best et al. 2005

… … … … … • LEDs on the speakers: • no information • which speaker • which time • or both Ask listener to identify a song from a random location, occurring at a random time Five simultaneous, similar sources, every 15 deg

For best subjects, when cue less important; they report “pop out” of familiar songs For identification of familiar birdsongs in a chorus,when and where bothhelp

For identifying digits intime-reversed digits,when doesn’t help For all subjects, when cue less important; forward digits “pop out” of reversed speech

Ongoing work How prior knowledge affects spatial attention The role of visual cuing of spatial attention Divided auditory attention Comparisons with visual attention Modeling spatial release from different interference

Psychophysics shows different maskers cause different forms of perceptual interference • Noise and modulated noise • reduce audibility of song elements • are dissimilar from targets • are easily segregated • don’t cause confusion • show spatial release due to acoustic better-ear improvements • Chorus (or reversed speech) • is sparse enough that overall interference is not as great • consists of “units” (syllables) like those in the targets • is hard to segregate from target • causes confusion between target and masker • shows spatial release through spatial attention

Comparing to avian behavior Dent Lab SUNY Buffalo

Dent Lab: Teaching birds to recognize the songs PECK left key to begin variable waiting period (2-7 s) HEAR a call from one of six individuals RECOGNIZE and CATEGORIZE call PECK left or right key Correct: food reward Incorrect: lights extinguished

Zebra finch and budgerigars learn zebra finch songs Percent Correct *Average sessions to criterion = 34.25 = 22

Relative effectiveness of the maskers differs across species worse performance (need louder target) Target threshold (dB re: masker target)

Next stage: measuring whether the birds use spatial attention like humans Budgerigars exhibit spatial release for noise maskers Binaural Hearing S + N 35 Unilateral Sound Source Bilateral Sound Source 30 25 20 Masked Threshold (in dB) 15 10 S + N N 5 0 1.00 2.00 2.86 4.00 Frequency (in kHz) *Dent et al., Behav. Neurosci., 1997

Avian psychophysics shows different maskers cause different levels of interference • Degree of interference differs from species to species • Next stage will explore whether effect of spatial separation differs with masker type, as in humans

Comparing to avian physiology Sen Lab Biomedical Engineering

Recording from zebra finch forebrain Field L (homologue of primary auditory cortex) Record neural spike trains in response to multiple copies of clean songs from five birds Record neural spike trains in response to repetitions of each song embedded in each masker

Each neuron has a set of spectrotemporal features to which it responds Broadband onset neuron Narrowband neuron Frequency Time

Compare clean-song templates to target + masker

Communicating in a Natural Cocktail Party: Relating Human and Animal Behavior to Neural Response

Communicating in a Natural Cocktail Party: Relating Human and Animal Behavior to Neural Response

Presentation Transcript

ANIMAL BEHAVIOR

Introduction to Animal Behavior

cocktail party catering

Animal Behavior

Animal Behavior

ANIMAL BEHAVIOR Behavior…anything an animal does in response to a stimulus

Cocktail Party Processing

Relating Animal Model Phenotypes to Human Disease Genes Project Goals:

Welcome to my cocktail party!

COCKTAIL PARTY

Introduction to Animal Behavior

Animal Behavior

UNLEASH YOUR PARTY ANIMAL IN THE BEST COCKTAIL PARTY PLACES IN POWAI BookEventZ

Cocktail Party Venue in Brisbane

NYC caterers Cocktail Party

Learn Cocktail Tricks In Cocktail Classes Hen Party

Animal Communicating

Buy A Party Cocktail Dress

Hire Cocktail Mixologists _ Cocktail Party Package

Cocktail Making Hen Party

Relating Animal Model Phenotypes to Human Disease Genes Project Goals:

The Cocktail Party Effect