My Group’s Current Research on Image Understanding

My Group’s Current Research on Image Understanding

An image-understanding task

Low-level vision

Color, Shape, Texture Low-level vision

Simple Segmentation Color, Shape, Texture Low-level vision

Simple Segmentation Color, Shape, Texture Object recognition Low-level vision

High-level perception Simple Segmentation Color, Shape, Texture Object recognition Low-level vision

High-level perception Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Low-level vision

High-level perception Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision

High-level perception “Meaning” Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision

High-level perception “Meaning” ??? Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision

High-level perception “Meaning” Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision The “SEMANTIC GAP’

High-level perception “Meaning” Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision HMAX model of visual cortex Riesenhuber, Poggio, et al. The “SEMANTIC GAP’

High-level perception “Meaning” Active Symbol Architecture for high-level perception Hofstadter et al. Simple Segmentation Color, Shape, Texture Object recognition Pattern recognition Analogy-making Low-level vision HMAX model of visual cortex Riesenhuber, Poggio, et al. The “SEMANTIC GAP’

The HMAX model for object recognition (Riesenhuber, Poggio, Serre, et al.)

Streetscenes “scene understanding” system(Bileschi, 2006) Recognition Phase … 1. Densely tile the image with windows of different sizes. 2. HMAX features are computed in each window. 3. The features in each window are given as input to the trained support vector machine. 4. If the SVM returns a score above a learned threshold, then the object is said to be “detected” . …

Object detection (here, “car”) with HMAX model (Bileschi, 2006)

Some limitations of the Streetscenes approach to scene understanding

Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization

Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over:

Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over: • Window size and location in the image

Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over: • Window size and location in the image • Object categories (e.g., car, pedestrian, tree, etc.)

Some limitations of the Streetscenes approach to scene understanding • Requires exhaustive search for object identification and localization Exhaustive search over: • Window size and location in the image • Object categories (e.g., car, pedestrian, tree, etc.) Exhaustive use of HMAX features in each window

Does not recognize spatial and abstract relationships among objects for whole scene understanding

Does not recognize spatial and abstract relationships among objects for whole scene understanding • Has no prior knowledge about object categories and their place in “conceptual space”

Does not recognize spatial and abstract relationships among objects for whole scene understanding • Has no prior knowledge about object categories and their place in “conceptual space” • HMAX model is completely feed-forward; no feedback to allow context to aid in scene understanding.

Goal of our project • Perform whole-scene interpretation without exhaustive search. • Incorporate conceptual knowledge • Allow feedforward and feedback modes to interact

A Simple Semantic Network (or “Ontology”) “Dog walking” Person Dog leash holds attached to action action walking

But... http://www.dogasaur.com/blog/wp-content/uploads/2011/04/dogwalker.jpg

But... http://www.vet.k-state.edu/depts/development/lifelines/images/dog_jog_1435.jpg

“Dog walking” Person Dog leash holds attached to Dog Group action action running walking

“Dog walking” Person Dog leash holds attached to Dog Group action action Allowing “conceptual slippage” running walking

But... http://3.bp.blogspot.com/_1YuoCTv4oKQ/S71jUDm7kOI/AAAAAAAAAak/jz4Pg7zzzQ8/s1600/23743577.JPG

http://lh3.ggpht.com/-ZZrYWeBFTjo/SFQH_0ijwaI/AAAAAAAABjA/8nwryW2BmEw/IMG_0356.JPGhttp://lh3.ggpht.com/-ZZrYWeBFTjo/SFQH_0ijwaI/AAAAAAAABjA/8nwryW2BmEw/IMG_0356.JPG

“Dog walking” Tail holds attached to leash Dog Group Person Dog action Cat action walking running Iguana

But... http://www.mileanhour.com/post/Dog-walking-bike.aspx

http://cl.jroo.me/z3/Z/e/C/d/a.aaa-Thus-walking-dog.png

ttp://thedaemon.com/images/DARPA_Segue_Dog.jpg

http://www.bikeforest.com/product45422.jpg

http://www.k9ring.com/blog/image.axd?picture=2010%2F3%2Fwalking_dog_from_car.jpghttp://www.k9ring.com/blog/image.axd?picture=2010%2F3%2Fwalking_dog_from_car.jpg

http://www.guy-sports.com/fun_pictures/dog_walking_helicopter.jpghttp://www.guy-sports.com/fun_pictures/dog_walking_helicopter.jpg

http://static.themetapicture.com/media/funny-dog-walking-horse-leash.jpghttp://static.themetapicture.com/media/funny-dog-walking-horse-leash.jpg

http://macwetblog.files.wordpress.com/2012/05/dog-walking.jpghttp://macwetblog.files.wordpress.com/2012/05/dog-walking.jpg

“Dog walking” Tail Person Dog leash Helicopter Segue-ing Biking Driving Car holds attached to Dog Group action action Cat running Iguana walking Horse Treadmill-ing

Active Symbol Architecture(Hofstadter et al., 1995)

Active Symbol Architecture(Hofstadter et al., 1995) • Basis for • Copycat (analogy-making), Hofstadter & Mitchell • Tabletop (anlaogy-making), Hofstadter & French • Metacat(analogy-making and self-awareness), Hofstadter & Marshall and many others…

Semantic network Active Symbol Architecture(Hofstadter et al., 1995) Perceptual agents (codelets) are “active symbols” Workspace Temperature

Petacat:(Descendant of Copycat, part of the PetaVisionproject)Integration of Active Symbol Architecture and HMAX Initial task: Decide if image is an instance of “taking a dog for a walk”, and if so, how good an instance it is.

My Group’s Current Research on Image Understanding

My Group’s Current Research on Image Understanding

Presentation Transcript

PPT Presentation

talk-ppt - PowerPoint Presentation

Image Processing Research Group

PowerPoint Presentation - NMSU: Research

My first ppt on this site

My PowerPoint

Image Understanding

My PowerPoint on Volcanoes'

Image Understanding

My Powerpoint

My Parts of Speech PowerPoint Presentation

My First Powerpoint Presentation

POWERPOINT PRESENTATION ON TRANSDUCERS

Group 6 Research Presentation

My research on:

Welcome To My Group Presentation

powerpoint presentation on english

My First Presentation in PowerPoint 2007

My Powerpoint

Animal Research Powerpoint Presentation

My published PowerPoint presentation

My PowerPoint Presentation