440 likes | 1k Views
Seminar on Media Technology. Computer Vision. Albert Alemany Font. Outlines. Introduction What is computer vision and why this topic History of computer vision and related disciplines Applications Face/smile detection, OCR, object recognition, medical imaging, ... Conclusions References.
E N D
Seminar on Media Technology Computer Vision • Albert Alemany Font
Outlines • Introduction • What is computer vision and why this topic • History of computer vision and related disciplines • Applications • Face/smile detection, OCR, object recognition, medical imaging, ... • Conclusions • References
What is computer vision? • Traffic scene • Number of vehicles • Type of vehicles • Location of closest obstacle • Assessment of congestion • Location of the scene captures • ... Given an image or more, extract properties of the 3D world
History of computer vision • 1950′s – Two dimensional imaging for statistical pattern recognition developed • 1960′s – Roberts begins studying 3D machine vision • 1970′s – MIT’s Artificial Intelligence Lab opens a "Computer Vision" course • 1980’s – New theories and concepts emerging. Shift toward geometry and increased mathematical rigor • 1990’s – Face recognition. Statistical analysis in vogue • 2000’s – Broader recognition. Large annotated datasets available. Video processing starts
Finding people in images "Yes" instances
Finding people in images "No" instances
Face detection Face detection in digital cameras • The camera detects faces in a scene and then automatically focus (AF) and optimizes exposure (AE) and, if needed, flash output
Optical character recognition (OCR) Technology to convert scanned docs to text
Vision-based biometrics 1984 - Right eye processed image Photographer: Steve McCurry How the Afghan girl was identified by her iris pattern: http://www.cl.cam.ac.uk/~jgd1000/afghan.html 2002 - Right eye processed image
Object recognition • Google goggles • Lincoln Microsoft Research Query image Matching image Webpage
Vision evolution Google reCaptcha
Making the invisible visible Raw version Eulerian Video Magnification for Revealing Subtle Changes in the World SIGGRAPH 2012 http://people.csail.mit.edu/mrub/vidmag/
Making the invisible visible Magnified version Eulerian Video Magnification for Revealing Subtle Changes in the World SIGGRAPH 2012 http://people.csail.mit.edu/mrub/vidmag/
Smart cars www.mobileye.com
Medical imaging 3D Imaging Image guided surgery
Special effects: shape capture The Matrix movies, ESC Entertainment
Special effects: motion capture Pirates of the caribbean, Industrial Light and Magic
Video-based interaction: gaming Microsoft Natal Sony Eyetoy
Image mosaic • 3D from multiple images • 3D from one image • "Big" image from other images/video
References • Richard Szeliski (2010). Computer Vision: Algorithms and Applications. Springer-Verlag. • Gérard Medioni and Sing Bing Kang (2004). Emerging Topics in Computer Vision. Prentice Hall. • Pedram Azad, Tilo Gockel, Rüdiger Dillmann (2008). Computer Vision – Principles and Practice. Elektor International Media BV. • http://people.csail.mit.edu/mrub/vidmag/ • http://www.cvpapers.com/