Accurate, Dense and Robust Multi-View Stereopsis

Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky

Agenda • Problem Statement • Multi-view Stereo Taxonomy • Algorithm • Results • Comparison to other works • Questions

Problem Statement • Multi-view Stereo • Dense shape reconstruction from multiple views = + + +

Multi-View Stereo Taxonomy S. M. Seitz, B. Curless, J. Diebel, D. Scharstein, and R. Szeliski • Scene Representation • Photoconsistency Measure • Visibility Model • Shape Prior • Reconstruction algorithm • Initialization

Scene Representation • Geometry on 3D grid • Voxels, Level sets • Polygon Mesh • Set of planar facets • Depth Map • Image that stores depth per pixel

Photoconsistency Measure • Definition: Measures visual compatibility of reconstruction with input images • Scene Space • Project part of reconstruction into images, measure closeness • Measures: Variance , sum of squared distances, normalized cross-correlation • Image Space • Use scene geometry to transform image to different view, measure error of predicted vs. actual (prediction error)

Visibility Model • Definition: Views to consider when evaluating photo consistency • Geometric • Explicitly model geometry of the scene • Quasi-Geometric • Approximate geometric reasoning • Outlier based approaches • Treat occlusions as outliers

Shape Prior • Definition: Additional constraints or assumptions about reconstruction • Minimal Surfaces • Level sets, Min-cut • Maximal Surfaces • Voxel coloring, space carving • Local Measures • Assume local smoothness on nearby pixels

Reconstruction Algorithm • Optimize cost function • Voxels, graph cut, level sets, meshes • A set of consistent depth maps • Feature extraction, matching, surface fitting

Initialization • Definition: Constraints on scene geometry • Bounding box or volume • Visual hull • Range of disparity

Overview of Algorithm input imagedetected reconstructed final patches polygonal surface features patches after after expansion from reconstructed the initial and filtering patches matching

Algorithm Block Diagram Initialization Expansion Filter Reconstruction Patch Model Feature Detection

Init • Detect features using Harris Corner and DoG • Feature matching to generate sparse set of patches

Patch Models • R(p): Most closely associated image with p • S(p): Images where p should be visible • T(p): Images where p is truly visible

c(p): from triangulation n(p): Direction of optical ray from c(p) to O β pixels Epipolar line

Normalized Cross Correlation (NCC) where is the mean of the feature and is the mean of f(x,y) in the region under the feature. Optimization step: Maximizing the average NCC score

Patch Expansion • Expand patches along tangential planes into empty areas. • Optimize for normal and center and add if photometric constraints are satisfied in at least k images.

Filtering • Analyzing visibility consistency

Filtering (Contd.) • Local smoothness constraint : Remove patches for which proportion of neighboring patches with tangential plane “nearly” parallel is less than ε

Polygonal Surface Reconstruction • Initialize using convex hull of patches • Iteratively deform/snap to the patch model using two kinds of forces • Smoothness term • Photometric Consistency term S : Current surface S* : True surface n(v) : Normal at v Π(v) : Set of patches compatible with v d(v) : Distance between S and S*

Algorithm Taxonomy Categorization • Scene Representation • Depth Map + Mesh • Photoconsistency Measure • NCC • Shape Prior • Assume local smoothness • Reconstruction • Feature extraction , depth maps, optimization over patches • Initialization • None

Results Patch Model Polygonal Surface Model

Results (Contd.)

Results (Contd.) • Evaluation on vision.middlebury.edu Accuracy Measure: Distance d in mm which brings 90% of the reconstruction within ground truth * Old Results

Results (Contd.) • Handle occlusions/obstacles

Similar Approaches • Setup similar to Goesele et al. (ICCV’07) – initialize patches, expand and optimize for position and normal

Questions • Pose the problem as an optimization problem simultaneously accounting for local smoothness, photo consistency, occlusion • Convergence of Expand/Filter – do more iterations lead to better reconstructions? • Occlusion/Outlier handling – results on more datasets • Advantages of patch model – Adaptive Resolution, generalizes to large number of object classes

Accurate, Dense and Robust Multi-View Stereopsis

Accurate, Dense and Robust Multi-View Stereopsis

Presentation Transcript

Stereopsis

Multi-View Geometry

Robust Scheduling: A General View

Towards Accurate and Robust Map-aid Indoor Human Tracking

Multi-View Sketching

What is Stereopsis ?

Multi-view stereo

Accurate Multi-View Reconstruction Using Robust Binocular Stereo and Surface Meshing

Stereopsis

Accurate Camera Calibration from Multi-view Stereo and Bundle Adjustment

Multi-view geometry

Multi-View Sketching

Binocular disparity and Stereopsis

Multi-view Drawing

Multi-View Sketching

Stereopsis

Multi-View Sketching

Stereopsis

Binocular disparity and Stereopsis

Multi-View Sketching

View Accurate and Speedy Live Sports Scores