980 likes | 1.1k Views
My Group’s Current Research on Image Understanding. An image-understanding task. Low -level vision. Color, Shape, Texture. Low-level vision. Simple Segmentation. Color, Shape, Texture. Low-level vision. Simple Segmentation. Color, Shape, Texture. Object recognition. Low-level vision.
E N D
Color, Shape, Texture Low-level vision
Simple Segmentation Color, Shape, Texture Low-level vision
Simple Segmentation Color, Shape, Texture Object recognition Low-level vision
High-level perception Simple Segmentation Color, Shape, Texture Object recognition Low-level vision
High-level perception Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Low-level vision
High-level perception Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision
High-level perception “Meaning” Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision
High-level perception “Meaning” ??? Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision
High-level perception “Meaning” Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision The “SEMANTIC GAP’
High-level perception “Meaning” Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision HMAX model of visual cortex Riesenhuber, Poggio, et al. The “SEMANTIC GAP’
High-level perception “Meaning” Active Symbol Architecture for high-level perception Hofstadter et al. Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision HMAX model of visual cortex Riesenhuber, Poggio, et al. The “SEMANTIC GAP’
High-level perception “Meaning” Active Symbol Architecture for high-level perception Hofstadter et al. Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision HMAX model of visual cortex Riesenhuber, Poggio, et al. The “SEMANTIC GAP’
The HMAX model for object recognition (Riesenhuber, Poggio, Serre, et al.)
Streetscenes “scene understanding” system(Bileschi, 2006) Recognition Phase … 1. Densely tile the image with windows of different sizes. 2. HMAX features are computed in each window. 3. The features in each window are given as input to the trained support vector machine. 4. If the SVM returns a score above a learned threshold, then the object is said to be “detected” . …
Object detection (here, “car”) with HMAX model (Bileschi, 2006)
Some limitations of the Streetscenes approach to scene understanding
Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization
Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over:
Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over: • Window size and location in the image
Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over: • Window size and location in the image • Object categories (e.g., car, pedestrian, tree, etc.)
Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over: • Window size and location in the image • Object categories (e.g., car, pedestrian, tree, etc.) Exhaustive use of HMAX features in each window
Does not recognize spatial and abstract relationships among objects for whole scene understanding
Does not recognize spatial and abstract relationships among objects for whole scene understanding • Has no prior knowledge about object categories and their place in “conceptual space”
Does not recognize spatial and abstract relationships among objects for whole scene understanding • Has no prior knowledge about object categories and their place in “conceptual space” • HMAX model is completely feed-forward; no feedback to allow context to aid in scene understanding.
Goal of our project • Perform whole-scene interpretation without exhaustive search. • Incorporate conceptual knowledge • Allow feedforward and feedback modes to interact
A Simple Semantic Network (or “Ontology”) “Dog walking” Person Dog leash holds attached to action action walking
But... http://www.dogasaur.com/blog/wp-content/uploads/2011/04/dogwalker.jpg
But... http://www.vet.k-state.edu/depts/development/lifelines/images/dog_jog_1435.jpg
“Dog walking” Person Dog leash holds attached to Dog Group action action running walking
“Dog walking” Person Dog leash holds attached to Dog Group action action Allowing “conceptual slippage” running walking
But... http://3.bp.blogspot.com/_1YuoCTv4oKQ/S71jUDm7kOI/AAAAAAAAAak/jz4Pg7zzzQ8/s1600/23743577.JPG
http://lh3.ggpht.com/-ZZrYWeBFTjo/SFQH_0ijwaI/AAAAAAAABjA/8nwryW2BmEw/IMG_0356.JPGhttp://lh3.ggpht.com/-ZZrYWeBFTjo/SFQH_0ijwaI/AAAAAAAABjA/8nwryW2BmEw/IMG_0356.JPG
“Dog walking” Tail holds attached to leash Dog Group Person Dog action Cat action walking running Iguana
But... http://www.mileanhour.com/post/Dog-walking-bike.aspx
http://www.k9ring.com/blog/image.axd?picture=2010%2F3%2Fwalking_dog_from_car.jpghttp://www.k9ring.com/blog/image.axd?picture=2010%2F3%2Fwalking_dog_from_car.jpg
http://www.guy-sports.com/fun_pictures/dog_walking_helicopter.jpghttp://www.guy-sports.com/fun_pictures/dog_walking_helicopter.jpg
http://static.themetapicture.com/media/funny-dog-walking-horse-leash.jpghttp://static.themetapicture.com/media/funny-dog-walking-horse-leash.jpg
http://macwetblog.files.wordpress.com/2012/05/dog-walking.jpghttp://macwetblog.files.wordpress.com/2012/05/dog-walking.jpg
“Dog walking” Tail Person Dog leash Helicopter Segue-ing Biking Driving Car holds attached to Dog Group action action Cat running Iguana walking Horse Treadmill-ing
Active Symbol Architecture(Hofstadter et al., 1995) • Basis for • Copycat (analogy-making), Hofstadter & Mitchell • Tabletop (anlaogy-making), Hofstadter & French • Metacat(analogy-making and self-awareness), Hofstadter & Marshall and many others…
Semantic network Active Symbol Architecture(Hofstadter et al., 1995) Perceptual agents (codelets) are “active symbols” Workspace Temperature
Petacat:(Descendant of Copycat, part of the PetaVisionproject)Integration of Active Symbol Architecture and HMAX Initial task: Decide if image is an instance of “taking a dog for a walk”, and if so, how good an instance it is.