Where computer vision needs help from computer science (and machine learning)

Where computer vision needs help from computer science (and machine learning) Bill Freeman Electrical Engineering and Computer Science Dept. Massachusetts Institute of Technology September 9, 2009

Outline • My background • Computer vision applications • Computer vision techniques and problems: • Low-level vision: underdetermined problems • High-level vision: combinatorial problems • Miscellaneous problems

At Photokina, in Cologne, Germany

Me (Foreign Expert) and my wife (English teacher), riding from the Foreigners’ Cafeteria at the Taiyuan University of Technology, Shanxi, China

While in China, I read this book (to be re-issued by MIT Press this year), and got very excited about computer vision. Studied for PhD at MIT.

Worked for 9 years at Mitsubishi Electric Research Labs, an industrial research lab doing fundamental research across the street from MIT.

2001 – present, MIT

Infinite images Joint work with: BilianaKaneva Josef Sivic ShaiAvidan Antonio Torralba

A computer graphics application ofbelief propagation for optimal seam finding

The image database • We have collected ~6 million images from Flickr based on keyword and group searches • typical image size is 500x375 pixels • 720GB of disk space (jpeg compressed)

Image representation GIST [Oliva and Torralba’01] Original image Color layout

Obtaining semantically coherent themes We further break-up the collection into themes of semantically coherent scenes: Train SVM-based classifiers from 1-2k training images [Oliva and Torralba, 2001]

Basic camera motions Starting from a single image, find a sequence of images to simulate a camera motion: Forward motion Camera rotation Camera pan

2. View from the virtual camera 3. Find a match to fill the missing pixels Scene matching with camera view transformations: Translation 1. Move camera 4. Locally align images 5. Find a seam 6. Blend in the gradient domain

Scene matching with camera view transformations: Camera rotation 1. Rotate camera 4. Stitched rotation 2. View from the virtual camera 3. Find a match to fill-in the missing pixels 5. Display on a cylinder

More “infinite” images – camera translation

Forward Rotate (left/right) Pan (left/right) Virtual space as an image graph • Nodes represent Images • Edges represent particular motions: Image graph • Edge cost is given by the cost of the image match under the particular transformation Kaneva, Sivic, Torralba, Avidan, and Freeman, Infinite Images, to appear in Proceedings of IEEE.

Virtual image space laid out in 3D Kaneva, Sivic, Torralba, Avidan, and Freeman, Infinite Images, to appear in Proceedings of IEEE.

Where computer vision needs help from computer science (and machine learning)

Where computer vision needs help from computer science (and machine learning)

Presentation Transcript

Computer Science in Practice

COMPUTER VISION: SOME CLASSICAL PROBLEMS

Computer SCIENCE

Computer Vision Machine Learning Features

Computer SCIENCE

Hierarchical Models of Vision: Machine Learning/Computer Vision

Computer SCIENCE

Computer SCIENCE

Computer Vision CS 776 Spring 2014

Challenges in Computer Vision

Computer SCIENCE

Computer Vision

Computer SCIENCE

Computer Science 101

CSE 185 Introduction to Computer Vision

Computer SCIENCE

Computer SCIENCE

Computer Science 210 Computer Organization

Computer Vision

Computer Vision

computer science assignment help

COMPUTER VISION: SOME CLASSICAL PROBLEMS