1 / 14

Computational Auditory Scene Analysis

Computational Auditory Scene Analysis. Kevin D. Donohue Databeam Professor Electrical and Computer Engineering University of Kentucky. Describe What You Hear. Scene 1. Scene 2. Scene 3. Sounds downloaded from http://www.prankcallsunlimited.com/. Auditory Scene Analysis.

Download Presentation

Computational Auditory Scene Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Computational Auditory Scene Analysis Kevin D. Donohue Databeam Professor Electrical and Computer EngineeringUniversity of Kentucky

  2. Describe What You Hear Scene 1 Scene 2 Scene 3 Sounds downloaded from http://www.prankcallsunlimited.com/

  3. Auditory Scene Analysis Auditory Scene Analysis (ASA) is a cognitive process that organizes sounds into perceptual objects. • Computational Auditory Scene Analysis (CASA) uses computational models to study ASA.

  4. Auditory Scene: Input • Sensory organs (ears) separate acoustic energy into frequency bands and convert band energy into neural firings • The auditory cortex receives the neural responses and abstracts an auditory scene. http://hyperphysics.phy-astr.gsu.edu/hbase/sound/hearcon.html Time Frequency

  5. High-Level Cognition Acoustic to Neural Conversion Organize into Auditory Streams Representation of Reality Auditory Scene: Perception • Perception derives a useful representation of reality from sensory input. • Auditory Stream refers to a perceptual unit associated with a single happening (A.S. Bregman, 1990) . Schema-driven/Top-downProcesses Primitive/Bottom-upProcesses

  6. Auditory Stream Experiment Bergman & Campbell (1971) • Streams tend to form by grouping notes close in time and frequency (similarity and proximity). http://www.psych.mcgill.ca/labs/auditory/demo3.html http://www.psych.mcgill.ca/labs/auditory/demo2.html

  7. Circularity in Pitch Judgement • Shepard’s Scale (1964) (Auditory Demonstrations CD, from the Acoustical Society of America)

  8. Perceptual Organization Organization properties: • Belongingness – a sensory element belongs to an organization (or stream) of which is a part. • Exclusive allocation – a sensory element cannot belong to more than one organization at a time. • Bregman & Rudnicky (1975)

  9. Perceptual Organization Organization properties: • Closure – perceived continuity, a tendency to close strong perceptual forms, response to missing evidence.

  10. Sequential and Spectral Integration in Forming Streams • Sequential Integration • Grouping sensory elements over time or events at different times considered to be from the same source/object. • Spectral Integration • Fusing simultaneous sensory elements over frequency into one.

  11. Timbre and Spectral Integration • The time envelope and harmonic structure give rise the timbre of the sound.

  12. Timbre and Spectral Integration • Simultaneous tones grouped by timbre Same Note (A) 2 Notes (F and A)

  13. Auditory Scene Organization • Primitive Stream Segregation • Inherent constraints in auditory scene analysis (perceptual organization demonstrated by infants/children) • Schema-based segregation • Learned constraints in auditory scene analysis (differences in perceptual organization resulting from training and culture) (A.S. Bregman, Auditory Scene Analysis, MIT Press 1990, pp. 1-45)

  14. Cues Use for Grouping From THE AUDITORY ORGANIZATION OF SPEECHAND OTHER SOURCES IN LISTENERS AND COMPUTATIONAL MODELS, M. Cooke and D. P.W. Ellis, 1999

More Related