210 likes | 220 Views
Outline. Theoretical approaches to computer vision Visual perception as information processing Problems in Computer Vision Classification Segmentation Recognition Motion analysis. Visual Perception as an Inverse Problem.
E N D
Outline • Theoretical approaches to computer vision • Visual perception as information processing • Problems in Computer Vision • Classification • Segmentation • Recognition • Motion analysis
Visual Perception as an Inverse Problem • Retinal images are generated by the light reflected from the 3-D world • The image formation is determined by the laws of optics • The area of image rendering is called computer graphics • Vision as an inverse problem • Get from optical images of scenes back to knowledge of the objects that gave rise to them Visual Perception Modeling
Vision as a Heuristic Process • Visual system makes a lot of assumptions about the nature of the environment and conditions under which it is viewed • These assumptions constrain the inverse problem enough to make it solvable most of the time • The resulting solution will be veridical if the assumptions are true • Vision is a heuristic process in which inferences are made about the most likely environmental condition that could have produced a given image Visual Perception Modeling
Perception as Bayesian Inference • Images I are observations • Scene properties S are not known • p(S) specifies the prior knowledge about the scene • The knowledge you have without looking at the image • Bayes’ rule Visual Perception Modeling
Four Stages of Visual Processing • Image-based stage • Surface-based stage • Object-based stage • Category-based stage Visual Perception Modeling
Image-based Stages • Most theorists agree that initial stage is not the only representation based on a two-dimensional retinal organization • It includes image-processing operations • Local edge and line detection • Region detection • Correspondence between left and right eyes • Marr called this representation primal sketches • Raw primal sketch • Full primal sketch Visual Perception Modeling
Representation in Early Vision • Local spatial/frequency representation • The representation should be • Local • Orientation-tuned • Frequency-tuned • Gabor filters • Wavelet transformation • Image compression Visual Perception Modeling
Gabor Filters Visual Perception Modeling
Surface-based Stage • Recovery of intrinsic properties of visible surfaces • Surface layout • The spatial distribution of visible surfaces within the 3-D environment • Explicit surface-based representation • 2.5-D sketch • Intrinsic images • Intrinsic properties to surfaces Visual Perception Modeling
Surface-Based Stage – cont. • Surface primitives • Local patches of 2-D surface within a 3-D space • Three-dimensional geometry • Projective geometry • Viewer-centered reference frame Visual Perception Modeling
Surface-Based Stage – cont. • Cues for surface representation • Stereopsis • Motion parallax • Shading and shadows • Pictorial properties • Texture • Size • Shape • Occlusion Visual Perception Modeling
Object-Based Stage • Some form of true 3-D representation • Includes unseen and occluded surfaces • Explicit representations of whole objects • Two ways of constructing object representation • Extend the surface-based representation • Infer 3-D objects from 2-D images Visual Perception Modeling
Object-Based Stage – cont. • Volumetric primitives • Descriptions of truly 3-D volumes • Three-dimensional geometry • Geometry in 3-D space • Object-based reference frame • Spatial relations among the volumetric primitives are represented by intrinsic structures among volumetric structures Visual Perception Modeling
Category-Based Stage • Final stage concerns with recovering fully the functional properties of objects • Functional properties through categorization • Properties directly from visible characteristics Visual Perception Modeling
Top-down vs. Bottom-up Processes • Bottom-up processing • Data driven processing • Take a lower-level representation as input and create or modify a higher-level representation • Top-down processing • Expectation-driven processing • Processes that take a higher-level representation as input and produce or modify a lower-level representation Visual Perception Modeling
Neural Network Approaches • Neural networks are based on the assumptions that human vision depends heavily on the massively parallel structure of neural circuits in the brain • Multiple Layer Perceptrons • Input layer • Hidden layer • Output layer Visual Perception Modeling
Problems in Computer Vision • Given a matrix of numbers representing an image, or a sequence of images, how to generate a perceptually meaningful description of the matrix? • An image can be a color image, gray level image, or other format such as remote sensing images • A two-dimensional matrix represents a single image • A three-dimensional matrix represents a sequence of images • A video sequence is a 3-D matrix • A movie is also a 3-D matrix Visual Perception Modeling
Image Classification • Given some types through examples, identify the type of a new image Visual Perception Modeling
A texture image Image Segmentation • Partition the images into homogenous regions • Widely studied problem • A very difficult problem • An important problem Visual Perception Modeling
A cheetah image Object Recognition • Object recognition • Recognize objects in a constrained environment • Identify objects from images Visual Perception Modeling
Video Sequence Analysis • Motion analysis • Compute motion from images • Motion segmentation • Video sequence analysis • Derive models automatically • Enhanced TV Visual Perception Modeling