Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29, 2009

Recovering Surface Layout from a Single ImageD. Hoiem, A.A. Efros, M. HebertRobotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29, 2009

Why worry about 3d scenes?

Reason 1: We may want to interact with the scene Navigation Manipulation

Reason 2: We need context

2D Object Detection

What the 2D Detector Sees

Computers need context too True Detection False Detections Missed Missed True Detections Local Detector: [Dalal-Triggs 2005]

Context in Image Space [Torralba Murphy Freeman 2004] [Kumar Hebert 2005] [He Zemel Cerreira-Perpiñán 2004]

We need 3d info to reason about 3d relationships Close Not Close

How to represent scene space?

How to represent scene space? Holistic Scene Space: “Gist” Torralba & Oliva 2002 Oliva & Torralba 2001

How to represent scene space? Depth Map Saxena, Chung & Ng 2005, 2007

Gibson’s Surface Layout • Gibson: “The elementary impressions of a visual world are those of surface and edge.” The Perception of the Visual World (1950) • Focus on texture gradients slide from Aude Oliva

Gibson’s Surface Layout Surface Layout (Gibson cont.) slide from Aude Oliva

Marr’s 2½D Sketch Marr’s 2½-D Sketch Figs from Aude Oliva slide

Surface Layout (this paper) • Goal: Label image into 7 Geometric Classes: • Support • Vertical • Planar: facing Left (), Center ( ),Right () • Non-planar: Solid (X), Porous or wiry (O) • Sky

Our Main Challenge • Recovering 3D geometry from single 2D projection • Infinite number of possible solutions! …

Our World is Structured Our World Abstract World Image Credit (left): F. Cunin and M.J. Sailor, UCSD

Hansen & Riseman 1978 (VISIONS) Barrow & Tenenbaum 1978 (Intrinsic Images) Brooks 1979 (ACRONYM) Marr 1982 (2½ D Sketch) Most Early Work Tried to Manually Specify the Structure Guzman 1968 Ohta & Kanade 1978

Learn the Structure of the World …

Infer Most Likely Scene Unlikely Likely

1. Use All Available Cues Color, texture, image location Vanishing points, lines Texture gradient

Use All Available Cues

2. Get Good Spatial Support 50x50 Patch 50x50 Patch

Image Segmentation • Single segmentation won’t work • Solution: multiple segmentations …

Labeling Segments … … For each segment: - Get P(good segment | data) P(label | good segment, data)

Image Labeling Labeled Segmentations … Labeled Pixels

Decision Trees + Adaboost High in Image? Gray? Yes No Yes No Smooth? Green? High in Image? Many Long Lines? … Yes Yes No Yes No Yes No No Blue? Very High Vanishing Point? Yes No Yes No Ground Vertical Sky Collins et al. 2002

Surface Confidence Maps P(Support) P(Vertical) P(Sky) P(Planar Left) P(Planar Center) P(Planar Right) Test Image P(Non-Planar Solid) P(Non-Planar Porous)

Experiments: Input Image

Experiments: Ground Truth

Experiments: Our Result

Surface Estimates: Outdoor Avg. Accuracy Main Class: 88% Subclass: 62% Input Image Ground Truth Our Result

Surface Estimates: Outdoor Ground Truth Our Result Input Image

Surface Estimates: Paintings Input Image Our Result

Surface Estimates: Indoor Avg. Accuracy Main Class: 93% Subclass: 76% Input Image Ground Truth Our Result

Failures: Reflections and Shadows Our Result Input Image

Average Accuracy Main Class: 88% Subclasses: 61%

Importance of Many Cues

Spatial Support Matters

Automatic Photo Popup Fit Ground-Vertical Boundary with Line Segments Form Segments into Polylines Cut and Fold Labeled Image Final Pop-up Model [Hoiem Efros Hebert 2005]

video

Surfaces Not Enough – Need Occlusion Reasoning Image Surface Labels 3D Model

Surfaces + Occlusions + Objects = Better 3D Models Surface Maps Depth, Boundaries Surfaces Occlusions Boundaries Support Horizon, Object Maps Horizon, Object Maps Viewpoint/Size Reasoning Objects and Viewpoint

video 2

Contributions • General principles • Learn the structure of the world • Use all available cues • Spatial support matters • Use redundancy to deal with unreliable processes (segmentation) • Results include entire spread of failure and success • First work to convincingly demonstrate single-view reconstruction

Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29, 2009

Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29, 2009

Presentation Transcript

“Re-engineering the scientific journal” Mark Patterson, Director of Publishing UHMLG Spring Forum: March 1st, 2009

NEPHAR 315 Pharmaceutical Chemistry Lab II 2009-2010 Spring Term

Welcome

Packet 2: Corporate Finance (B40.2302.20) Spring 2009 Aswath Damodaran

802.11 Nov 2009 Closing Plenary Reports

End-of-Course Assessment Test Coordinator’s Manual Phase I 2008-2009

15-446 Distributed Systems Spring 2009

Action Recognition

CSC 415: Translators and Compilers Spring 2009

7 th Grade Academic Vocabulary

2009

Features

WELCOME TO OUR PRESENTATION

Spring 2009

Structuring Successful Learning Fall 2009