540 likes | 674 Views
Patch Based Synthesis for Single Depth Image Super-Resolution (ECCV 2012) Oisin Mac Aodha , Neill Campbell, Arun Nair and Gabriel J. Brostow. Presented By: Itzik Ben Shabat January 2014. Contents. Problem & Motivation SR General Overview Related Work The Proposed Method Results
E N D
Patch Based Synthesis for Single Depth Image Super-Resolution (ECCV 2012)Oisin Mac Aodha, Neill Campbell, Arun Nair and Gabriel J. Brostow Presented By: Itzik Ben Shabat January 2014
Contents • Problem & Motivation • SR General Overview • Related Work • The Proposed Method • Results • Qualitative • Quantitative • Future Work • Paper review
Problem & Motivation • How do we convert a Low Resolution (LR) image to High Resolution (HR) ? • Get a better camera (Sensor)
Problem & Motivation • Cant get a better camera? Super Resolve the image! (SR)
Problem & Motivation • Now do it in RGB-D ! ! ! PMD CamCube - 200X200 MS Kinect - 640x480 PointGreyBumbleBee 2 - 640x480 at 48fps
SR General Overview Common approaches: • Take multiple LR images from different angles and reconstruct the additional information (requires multiple images)
SR General Overview Common approaches: • Use a LR to HR database (requires a database) • Focus on this approach
Related Work • Intensity Images • EbSR [15] - Freeman, W.T., Liu, C.: Example-based super resolution. In: Advances in Markov Random Fields for Vision and Image Processing. MIT Press (2011) • Similar: • Filter input • Normalized Patch matching • Solving minimum energy problem (using BP) • Different • Not Designed for RGB-D images • Matching HR and interpolated LR patches
Related Work • EbSR looking closer EbSR Output Ground trouth
Related Work • Intensity Images • ScSr [17] - Yang, J., Wright, J., Huang, T.S., Ma, Y.: Image super-resolution via sparse representation. IEEE Transactions on Image Processing (2010) • Similar: • Use patches and minimization problem • Different: • Not designed for RGB-D images • Two dictionaries • Database structure (sparse representation) • Solves 2 minimization problems separately – global and local • No noise reduction implementation
Related Work • Depth + Intensity Hybrids • Cross Bilateral [1] - Yang, Q., Yang, R., Davis, J., Nister, D.: Spatial-depth super resolution for range images. In: CVPR. (2007) • Similar: • Specific for RGB-D • Use bilateral filter • Different: • Requires additional input (destination resolution image) • Doesn’t use patches • Solves a fusion problem
Related Work • Depth + Intensity Hybrids • MRF SR [25]- Diebel, J., Thrun, S.: An application of markov random fields to range sensing. In: NIPS. (2005) • Similar: • Uses MRF • Different: • Uses multi-resolution MRF • Requires additional input (destination resolution image) • Doesn’t use patches • Solves a fusion problem
The Proposed Method • Challenges • Construct database • Noise • “Flying pixels” at discontinuities • Wrong depths for specular or dark materials • Edges – jarring artifacts (different than rgb image)
The Proposed Method – Database • Constructing the database • Less sources for database construction than rgbimages • Considered synthetic Vs. Real datasets • Database uses 30 scenes of 800x800 (scenes flipped left to right) – 5.3 million patches • Pruning to remove redundant patches (planar surfaces) Synthetic Laser Scan
The Proposed Method - Filtering • Noise Reduction • Assumption – High frequency=noise • A. Bilateral filter on input patches before patch normalization • Edge preserving • Noise reducing • Nonlinear • Weighted average of intensity values from nearby pixels • *Used in Adobe Photoshop “Blur” function • B. Bicubic filter on database HR patches before down-sampling
The Proposed Method - Filtering After Bilateral filtering Input
The Proposed Method - Filtering • Noise Reduction • Pro – Cleaner image for patching • Con – Some data is lost
The Proposed Method – Matching Input Depth Image
The Proposed Method – Matching Input Depth Image N non overlapping low resolution input patches xi For each xi we wish to find its corresponding high resolution yi Patches are normalized
The Proposed Method – Matching High Resolution Database
The Proposed Method – Matching Input Depth Image Output Depth Image High Resolution Database
The Proposed Method – Matching Input Depth Image Output Depth Image High Resolution Database
The Proposed Method – Matching Output Depth Image High Resolution Database
The Proposed Method – Matching Input Depth Image Output Depth Image … High Resolution Database
The Proposed Method – Matching Input Depth Image Output Depth Image … High Resolution Database
The Proposed Method – Matching Input Depth Image Output Depth Image … High Resolution Database
The Proposed Method – Matching Input Depth Image Output Depth Image … High Resolution Database
The Proposed Method – Matching Input Depth Image Output Depth Image … … High Resolution Database
The Proposed Method - Matching • Matching Patches to database • Matching is done between LR patches • Kd tree is used for speeding up the process
The Proposed Method - Reconstruction • Solving minimum energy problem • Solved using TRW-S algorithm (based on belief propagation) • Ed -Unary Potential - Difference between normalized matching LR patches
The Proposed Method - Reconstruction • Es -Pairwise Potential - Difference between un-normalized HR patch overlaps
The Proposed Method - Reconstruction • Es - Pairwise Potential - yi yj
The Proposed Method - Reconstruction • Es - Pairwise Potential - yi yj
The Proposed Method - Reconstruction • Es - Pairwise Potential - yi yj
The Proposed Method - Reconstruction • Es - Pairwise Potential - yi yj
The Proposed Method - Reconstruction )2 )2 + ( Es=( - - yi yj
The Proposed Method - Reconstruction • Normalization – is un-normalized based on the input patch min and max values
The Proposed Method – Filter Results • Noise reduction • C. Post processing Denoising – Outlier detection and correction using threshold Result after denoising Result input
Results - Qualitative • Exp 1: • Used Middleburry stereo dataset • Down-sampled the ground truth (X2,X4) • Reconstructed • Compared RMSE • Exp2: • 3 laser scans • Upsampled by 4 • Ground touth comparison • Exp3: • Use synthetic Vs. real database
Results - Qualitative • Proposed method Vs. Other Methods (Exp. 2) Proposed Method
Results - Qualitative • Proposed method - Real Vs. Synthetic training data (Exp. 3)
Results - Quantitative Method used • Reminder: MRF RS Cross Bilateral ScSREbSR RGB-D image used RGB-D image used Method used Upsampling factor Method used RMSE
Results - Qualitative Results Movie
Conclusions • 1st or 2nd best from intensity based methods • MRF RS, Cross Bilateral are better but require more data • Speed is not realtime compatible • Super resolving moving depth videos • Synthetic data exhibit better results than scanned data for training.
Future work • Extend to exploit temporal context (in video) • Exploit context when querying the database • Develop a sensor specific noise model for better results