550 likes | 562 Views
Explore the challenges and techniques for extracting 3D information from 2D images to infer geometric classes using various cues and spatial support, ultimately leading to accurate surface layout estimates.
E N D
Recovering Surface Layout from a Single ImageD. Hoiem, A.A. Efros, M. HebertRobotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29, 2009
Reason 1: We may want to interact with the scene Navigation Manipulation
Computers need context too True Detection False Detections Missed Missed True Detections Local Detector: [Dalal-Triggs 2005]
Context in Image Space [Torralba Murphy Freeman 2004] [Kumar Hebert 2005] [He Zemel Cerreira-Perpiñán 2004]
We need 3d info to reason about 3d relationships Close Not Close
How to represent scene space? Holistic Scene Space: “Gist” Torralba & Oliva 2002 Oliva & Torralba 2001
How to represent scene space? Depth Map Saxena, Chung & Ng 2005, 2007
Gibson’s Surface Layout • Gibson: “The elementary impressions of a visual world are those of surface and edge.” The Perception of the Visual World (1950) • Focus on texture gradients slide from Aude Oliva
Gibson’s Surface Layout Surface Layout (Gibson cont.) slide from Aude Oliva
Gibson’s Surface Layout Surface Layout (Gibson cont.) slide from Aude Oliva
Marr’s 2½D Sketch Marr’s 2½-D Sketch Figs from Aude Oliva slide
Surface Layout (this paper) • Goal: Label image into 7 Geometric Classes: • Support • Vertical • Planar: facing Left (), Center ( ),Right () • Non-planar: Solid (X), Porous or wiry (O) • Sky
Our Main Challenge • Recovering 3D geometry from single 2D projection • Infinite number of possible solutions! …
Our World is Structured Our World Abstract World Image Credit (left): F. Cunin and M.J. Sailor, UCSD
Hansen & Riseman 1978 (VISIONS) Barrow & Tenenbaum 1978 (Intrinsic Images) Brooks 1979 (ACRONYM) Marr 1982 (2½ D Sketch) Most Early Work Tried to Manually Specify the Structure Guzman 1968 Ohta & Kanade 1978
Infer Most Likely Scene Unlikely Likely
1. Use All Available Cues Color, texture, image location Vanishing points, lines Texture gradient
2. Get Good Spatial Support 50x50 Patch 50x50 Patch
Image Segmentation • Single segmentation won’t work • Solution: multiple segmentations …
Labeling Segments … … For each segment: - Get P(good segment | data) P(label | good segment, data)
Image Labeling Labeled Segmentations … Labeled Pixels
Decision Trees + Adaboost High in Image? Gray? Yes No Yes No Smooth? Green? High in Image? Many Long Lines? … Yes Yes No Yes No Yes No No Blue? Very High Vanishing Point? Yes No Yes No Ground Vertical Sky Collins et al. 2002
Surface Confidence Maps P(Support) P(Vertical) P(Sky) P(Planar Left) P(Planar Center) P(Planar Right) Test Image P(Non-Planar Solid) P(Non-Planar Porous)
Surface Estimates: Outdoor Avg. Accuracy Main Class: 88% Subclass: 62% Input Image Ground Truth Our Result
Surface Estimates: Outdoor Ground Truth Our Result Input Image
Surface Estimates: Outdoor Ground Truth Our Result Input Image
Surface Estimates: Paintings Input Image Our Result
Surface Estimates: Indoor Avg. Accuracy Main Class: 93% Subclass: 76% Input Image Ground Truth Our Result
Failures: Reflections and Shadows Our Result Input Image
Average Accuracy Main Class: 88% Subclasses: 61%
Automatic Photo Popup Fit Ground-Vertical Boundary with Line Segments Form Segments into Polylines Cut and Fold Labeled Image Final Pop-up Model [Hoiem Efros Hebert 2005]
Surfaces Not Enough – Need Occlusion Reasoning Image Surface Labels 3D Model
Surfaces + Occlusions + Objects = Better 3D Models Surface Maps Depth, Boundaries Surfaces Occlusions Boundaries Support Horizon, Object Maps Horizon, Object Maps Viewpoint/Size Reasoning Objects and Viewpoint
Contributions • General principles • Learn the structure of the world • Use all available cues • Spatial support matters • Use redundancy to deal with unreliable processes (segmentation) • Results include entire spread of failure and success • First work to convincingly demonstrate single-view reconstruction