1 / 36

frequency

C. Shannon: Communication in Presence of Noise combination of channel and signal spectrum should be as flat (as random-like) as possible. energy of the signal. level of noise in the channel. frequency. energy of the signal. energy of the signal. level of noise in the channel.

tadhg
Download Presentation

frequency

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. C. Shannon: Communication in Presence of Noise • combination of channel and signal spectrum should be as flat (as random-like) as possible energy of the signal level of noise in the channel frequency

  2. energy of the signal energy of the signal level of noise in the channel level of noise in the channel resource space resource space Forces of Nature if the receiver could be controlled • put more resources (introduce less noise) where there is more signal • biological system optimized for information extraction from sensory signals • if signal could be controlled (e.g. in communication) • put more signal where there is less noise • sensory signal optimized for a given communication channel

  3. Roman Jakobson • Born in Moscow in 1896, • Co-founder Moscow Linguistic Circle (1915) • Prague Linguistic Circle (1926). • Free French University New York (1942-1946) • Professor at Columbia, Harvard, and M.I.T. • 1982 We speak in order to hear in order to be understood

  4. Radio Rex (1917) Newton l/4 beer Where is the message? /u/ /o/ /a/ /e/ /iy/ • “limited commercial success” • -John Pierce 1969

  5. frequency time time

  6. frequency time

  7. time frequency get spectral components time Short-term Spectrum 10-20 ms /j/ /u/ /ar/ /j/ /o/ /j/ /o/

  8. Spectrogram – 2D representation of sound

  9. Short-term Fourier analysis

  10. critical bandwidth Spectral resolution of hearing spectral resolution of hearing decreases with frequency (critical bands of hearing, perception of pitch,…) 10000 5000 2000 1000 critical bandwidth [Hz] 500 100 threshold of perception of the tone 50 50 100 500 1000 5000 10000 frequency [Hz] what happens outside the critical band does not affect decoding of the sound in the critical band noise bandwidth

  11. frequency energies in “critical bands”

  12. Sensitivity of hearing depends on frequency

  13. loudness = intensity 0.33 intensity (power spectrum) intensity ≈ signal 2 [w/m2] loudness [Sones] |.|0.33 loudness

  14. Not all spectral details are important a) compute Fourier transform of the auditory spectrum and truncate it (cepstrum) b) approximate the auditory spectrum by an autoregressive model 6th order AR model 14th order AR model power (loudness) power (loudness) frequency (tonality) frequency (tonality)

  15. It’s about time (to talk about TIME)

  16. ~200 ms

  17. filter

  18. Several hundreds of miliseconds long buffer appears to be consistent with perception • data in the buffer interact • data outside the buffer do not interact with the data in the buffer temporal buffer => filter

  19. time trajectories of the spectrum

  20. Frequency response (peak around 5 Hz) Impulse response (effective length around 200 ms)

  21. Frequency response (peak around 5 Hz) Impulse response (effective length around 200 ms)

  22. spectrogram (short-term Fourier spectrum) time [s] Perceptual Linear Prediction (PLP) (12th order model) RASTA-PLP

  23. Machine recognition of speech trained on data from New Jersey Labs

  24. filter

  25. filter spectrogram spectrum from RASTA-PLP

More Related