380 likes | 514 Views
Vision in Man and Machine . STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. www.stat.ucla/~yuille. The Purpose of Vision. “To Know What is Where by Looking”. Aristotle. (384-322 BC).
E N D
Vision in Man and Machine.STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. www.stat.ucla/~yuille
The Purpose of Vision. • “To Know What is Where by Looking”. Aristotle. (384-322 BC). • Information Processing: receive a signal by light rays and decode its information. • Vision appears deceptively simple, but there is more to Vision than meets the Eye.
Visual Illusions • The perception of brightness of a surface, • or the length of a line, • depends on context. • Not on basic measurements like: • the no. of photons that reach the eye • or the length of line in the image..
Perception as Inference • Helmholtz. 1821-1894. • “Perception as Unconscious Inference”.
How Hard is Vision? • The Human Brain devotes an enormous amount of resources to vision. • (I) Optic nerve is the biggest nerve in the body. • (II) Roughly half of the neurons in the cortex are involved in vision (van Essen). • If intelligence is proportional to neural activity, then vision requires more intelligence than mathematics or chess.
Vision and Artificial Intelligence • The hardness of vision became clearer when the Artificial Intelligence community tried to design computer programs to do vision. ’60s. • AI workers thought that vision was “low- level” and easy. • Prof. Marvin Minsky (pioneer of AI) asked a student to solve vision as a summer project.
Chess and Face Detection • Artificial Intelligence Community preferred Chess to Vision. • By the mid-90’s Chess programs could beat the world champion Kasparov. • But computers could not find faces in images.
Man and Machine. • David Marr (1945-1980) • Three Levels of explanation: 1. Computation Level/Information Processing 2. Algorithmic Level 3. Hardware: Neurons versus silicon chips. Claim: Man and Machine are similar at Level 1.
Vision as Probabilistic Inference • Represent the World by S. • Represent the Image by I. • Goal: decode I and infer S. • Model image formation by likelihood function, generative model, P(I|S) • Model our knowledge of the world by a prior P(S).
Bayes Theorem • Then Bayes’ Theorem states we show infer the world S from I by • P(S|I) = P(I|S)P(S)/P(I). • Rev. T. Bayes. 1702-1761
Bayes to Infer S from I • P(I|S) likelihood function . P(S) prior. .
Technically very interdisciplinary • But applying Bayes is not straightforward. • A beautiful theory is being developed adapting techniques from Computer Science, Engineering, Mathematics, Physics, and Statistics. • E.G. Probabilistic Reasoning (Pearl CS), Level Sets (Osher Maths).
Examples • Generative Models • Visual Inference: (1) Estimating Shape. (2) Segmenting Images. (3) Detecting Faces. (4) Detecting and Reading Text.
Generative Models Learn Generative Models from a few images and then generate new images.
Uses of Generative Models Univ. Oxford
Shape and Photometry ( Soatto Lab) • Estimate geometry (shape) and photometry from multiple images. Jin-Soatto-Yezzi
Compare ground truth (Soatto Lab) Estimated shape Alternative algorithm Ground truth Jin-Soatto-Yezzi 11/1/02
Compare w. ground truth (Soatto Lab) Generated Image: synthesized from novel viewpoint and illumination. Ground Truth: same lighting and viewpoint Jin-Soatto-Yezzi 11/1/02
Segmenting Images (Zhu Lab) • Characterize the set of image patterns that occur in natural images. Provide mathematical models. P(I|S) and P(S).
Back to the Brain • Top-Level; compare human performance to Ideal Observers. • Explain human perceptual biases (visual illusions) as strategies that are “statistical effective”.
Brain Architecture • The Bayesian models have interesting analogies to the brain. • Generative Models require top-down processing
Conclusion • Vision is unconscious inference. • Theory of Vision for Man and Machine. • See more about Vision at UCLA in the Vision and Image Science Collective • http://visciences.ucla.edu