760 likes | 859 Views
1. Cross-Modal (Visual-Auditory) Denoising. Dana Segev Yoav Y. Schechner Michael Elad. Technion – Israel Institute of Technology. Motivation. Noisy digits sequence. Digits sequence. Denoised by state of the art algorithm of Cohen & Berdugo. Segev, Schechner, Elad, Cross-Modal Denoising.
E N D
1 Cross-Modal (Visual-Auditory) Denoising Dana Segev Yoav Y. Schechner Michael Elad Technion – Israel Institute of Technology
Motivation Noisy digits sequence Digits sequence Denoised by state of the art algorithm of Cohen & Berdugo Segev, Schechner, Elad, Cross-Modal Denoising
Motivation • Use one modality to denoise another? • Use video to denoise • a soundtrack? Segev, Schechner, Elad, Cross-Modal Denoising
a Noise • Very intense • Non-stationary • Unknown • Unseen source. Single microphone Segev, Schechner, Elad, Cross-Modal Denoising
denoised audio Cross-modal Example-Based very noisy audio Input time (sec) video Algorithm Output For human and machine hearing Segev, Schechner, Elad, Cross-Modal Denoising
Intuition Segev, Schechner, Elad, Cross-Modal Denoising
Intuition Segev, Schechner, Elad, Cross-Modal Denoising
Intuition I E Training xample set nput test set Segev, Schechner, Elad, Cross-Modal Denoising
Speech Examples Extraction Segev, Schechner, Elad, Cross-Modal Denoising
Speech Examples Extraction ~syllable (0.25 sec) Segev, Schechner, Elad, Cross-Modal Denoising
Music Segments Extraction lophone Xylophone Segev, Schechner, Elad, Cross-Modal Denoising
Music Segments Extraction lophone Xylophone Sound Segev, Schechner, Elad, Cross-Modal Denoising
Principle ... ... Examples Segev, Schechner, Elad, Cross-Modal Denoising
Principle ... ... Examples Segev, Schechner, Elad, Cross-Modal Denoising
Audio Only ... ... Examples Segev, Schechner, Elad, Cross-Modal Denoising
Audio Only ... ... Examples Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Denoising • Cross-modal representation. • Generating multimodal features. • Learning feature statistics. • Cross-modal pattern recognition. • Rendering a denoised signal. Segev, Schechner, Elad, Cross-Modal Denoising
Feature-space Creation time (sec) Input video Video feature-space Input audio Audio feature-space Segev, Schechner, Elad, Cross-Modal Denoising
Feature-space Creation time (sec) Audio-video feature-space Input audio-video Segev, Schechner, Elad, Cross-Modal Denoising
Feature-space Creation Audio-video examples feature-space Training audio-video time (sec) Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure Feature-space Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure Feature-space Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure Feature-space Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure Nearest Neighbor Feature-space Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure Nearest Neighbor Feature-space Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure ... ... Examples Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure ... ... Examples Segev, Schechner, Elad, Cross-Modal Denoising
Rendering a denoised signal Noisy audio Clean segment Clean segment Clean segment Segev, Schechner, Elad, Cross-Modal Denoising
Rendering a denoised signal Noisy audio Clean segment Clean segment Clean segment Denoised Segev, Schechner, Elad, Cross-Modal Denoising
Distance-measure ... ... Examples Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... ... ... ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... ... ... ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Bartender experiment Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... ... ... ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Denoising • Cross-modal representation. • Generating multimodal features. • Learning feature statistics. • Cross-modal pattern recognition (NN). • Rendering a denoised signal. Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior Feature-space Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior Feature-space For the k-th example segment: Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior bi - fif - ty- two Feature-space For the k-th example segment: bi ty ar fif two Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior Next cluster bi ty fif two ar 1 bi 1 1 1 ty 1 fif 1 Feature-space 1 2 1 two bi 1 ar Current cluster ty ar fif two Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior Syllable consecutive probability Next cluster bi ty fif two ar 53 23 bi 26 5 1 12 60 43 17 6 ty 22 4 1 fif 5 3 6 2 13 12 21 two 9 7 2 7 11 ar = Current cluster Number of examples in training set The probability for transition between clusters Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior Hidden Markov Model fif fif Time delay two two bi ty ty bi P Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior Audio noise fif fif Time delay two two bi ty ty bi P Segev, Schechner, Elad, Cross-Modal Denoising
Feature Statistics as a Prior Hidden Markov Model Audio noise fif fif + Time delay two two bi ty ty bi P Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... ... ... ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Examples ... ... ... ... ... ... ... ... ... ... Input Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Input video Segev, Schechner, Elad, Cross-Modal Denoising
Cross-Modal Association Input video Segev, Schechner, Elad, Cross-Modal Denoising