170 likes | 288 Views
Vision-based Interaction. Scott Klemmer 17 November 2005. cs 547: Blake Ross and Asa Dotzler. Mozilla: Creating simple software in a geek-driven culture. The first vision-based interface. Myron Krueger used computer vision to create Responsive Environments (1970s) “Reaction is the Medium”
E N D
Vision-based Interaction Scott Klemmer 17 November 2005
cs547: Blake Ross and Asa Dotzler Mozilla: Creating simple software in a geek-driven culture
The first vision-based interface • Myron Krueger used computer vision to create Responsive Environments (1970s) • “Reaction is the Medium” • http://www.artmuseum.net/w2vr/timeline/videoplace_video.html
How it works • Video and background are separated in analog using chroma key techniques(think broadcast news) • The first and last points of each raster are stored in the computer, and represent the person’s outline
Vision-based UIs: “Verbs” • Detecting and Tracking elements of a certain type in a scene • Capturing contents of detected objects • Recognizing individual members in an object class
Vision-based UIs: “Verbs” • Detecting and Tracking elements of a certain type in a scene
Vision-based UIs: “Verbs” • Capturing contents of detected objects
Vision-based UIs: “Verbs” • Recognizing individual members in a class
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects
I N F R A S T R U C T U R E Background Subtraction
Image Moments (of Inertia) • 0th moment is mass(total number of pixels)
Image Moments (of Inertia) • 1st moment is center
Image Moments (of Inertia) • 2nd moment is orientation
Tools for Vision apps • Intel’s OpenCV • C API to highly optimized image processing functions (threshold, dilate, optical flow, …) • http://www.intel.com/research/mrl/research/opencv • Fast to run! Slow to develop • Great for vision folks; too low-level for app folks • Papier-Mâché • Java API (and to some extent visual UI) for vision (and other physical input) • http://guir.berkeley.edu/papier-mache • Fast to develop! Slow to run • Great for app folks; ~5 fps can sometimes be too slow
Good Vision Books • Computer Vision: A Modern Approach • David Forsyth and Jean Ponce (2003) • Fantastic book; but goal is more theoretical understanding than practical application • Robot Vision • Berthold Horn (1987) • More focused on apps and interactive algorithms • Somewhat out of date