330 likes | 458 Views
3D Reconstruction of Objects behind Dense Occlusions. Vaibhav Vaish. Joint work with Rick Szeliski, Larry Zitnick, Sing Bing Kang, Brian Curless. SAP: Seeing through occlusions. Images: 82 Aperture: 1m. SAP: Seeing through occlusions. Images: 45 Aperture: 2m. SAP: Crowd scene.
E N D
3D Reconstruction of Objects behind Dense Occlusions Vaibhav Vaish Joint work with Rick Szeliski, Larry Zitnick, Sing Bing Kang, Brian Curless
SAP: Seeing through occlusions Images: 82 Aperture: 1m
SAP: Seeing through occlusions Images: 45 Aperture: 2m
SAP: Crowd scene Images: 60 Aperture: 90o
SAP: Crowd scene Images: 90 Aperture: 3m
Synthetic Aperture: Strengths • Can see through occluders of high density • No segmentation of foreground required • Works with noisy images, no color calibration required If we have enough cameras spanning a wide enough aperture, we can see through occlusions
3D Reconstruction • Challenges: • Find 3D location and shape of occluded objects • Construct an image with non-planar focal surface so that all occluded objects are in focus • Questions: • Does shape from focus do better than shape from stereo ? • Can we design alternatives to focus / stereo which are more robust in the presence of occlusions ?
Outline • 3D Reconstruction via plane sweep • Stereo v/s Focus • Alternative methods • Results • Future Work
z Plane Sweep: Stereo
Plane Sweep: Stereo Computing depth(x,y): for each depth z for each pixel (x,y) V(x,y,z) variance of rays through (x,y,z) (x,y) is assigned the depth z for which the variance is minimum z
Plane Sweep: Focus Computing depth(x,y): for each depth z compute SAP image focused at depth z for each pixel (x,y) F(x,y,z) gradient of SAP image at (x,y) (x,y) is assigned the depth z for which F(x,y,z) is maximum z
Depth Estimation Framework • Initial depth estimate • For each depth assignment (x, y, d), compute a cost measure • For each pixel, find the minimum cost depth • Global Optimization • Graph cuts, belief propagation • Enforce smoothness constraints • Iterative Refinement
Variance vs. focus: intensity ramp In regions of constant gradient, variance can recover correct depth, but focus is unresponsive. Focus requires non-zero 3rd order gradients to recover depth.
Range of depths Occlusions Depth of background object Foreground occluders
Median Color • Both the stereo and focus operators use the mean as an estimate of surface color • In presence of occlusions the mean may not be an accurate measure – even when the depth is right • Could we do better by using a more robust measure like median ?
Median Color Computing depth(x,y): for each depth z for each pixel (x,y) compute median color of rays through (x,y,z) M(x,y,z) sum of distances to median (x,y) is assigned the depth z which minimizes M(x,y,z)
Entropy At the correct depth, the intensity histogram should be peaked
Entropy At incorrect depths, the histogram should be more uniform Use histogram entropy H = - x log x as a cost function
Test Scene 105 images (21x5 light field) 60cm x 10cm synthetic aperture Image Resolution: 650x515
Synthetic Focus Mean color at estimated depth
Variance Mean color at estimated depth
Median Color Median color at estimated depth
Entropy Mean color at estimated depth
- = +…+ = Adding per-view occlusion maps yields occlusion map for points on the CD surface Comparison: Ground truth and Occlusion Map
Future Work • Compare with algorithms that try to delete occluding objects • Voxel Coloring [Seitz, CVPR 97] • Analyze effect of different kinds of occluders • color distribution, feature size • Aperture shape, size and sampling • Impact of number of cameras • Add global optimization
Acknowledgements • Rick, Larry, Sing Bing, Brian … • Fellow Interns (Jan, Noah, Ashish, Karteek, Ashley, Le, Manuel …) • Assistance in acquisition • Augusto Roman, David Koller, Neel Joshi