Robust Higher Order Potentials For Enforcing Label Consistency

Robust Higher Order Potentials For Enforcing Label Consistency Pushmeet Kohli Microsoft Research Cambridge Lubor Ladicky Philip Torr Oxford Brookes University, Oxford CVPR 2008

Image labelling Problems Assign a label to each image pixel Object Segmentation Image Denoising Geometry Estimation Sky Building Tree Grass

Object Segmentation using CRFs (Shotton et al. ECCV 2006) CRF Energy Unary potentials based on Colour, Location and Texture features Encourages label consistency in adjacent pixels

Limitations of Pairwise CRFs • Encourages short boundaries (Shrinkage bias) • Can only enforce label consistency in adjacent pixels • Inability to incorporate region based features Image Unary Potential MAP-CRF Solution

Label Consistency in Image Regions • Pixels constituting some regions belong to • Same plane (Orientation) (Hoiem, Efros, & Herbert, ICCV’05) • Same object (Russel, Efros, Sivic, Freeman, & Zisserman, CVPR06) Image (MSRC) Segmentation (Mean shift)

Image labelling using segments Unsupervised Segmentation Object Labelling Image • Geometric Context • [Hoiem et al, ICCV05] • Object Segmentation • [He et al. ECCV06, Yang et al. CVPR07, Rabinovichet al. ICCV07, Batraet al. CVPR08] • Interactive Video Segmentation • [Wang, SIGGRAPH 2005 ] Not robust to Inconsistent Segments!

Our Higher Order CRF Model Encourages label consistency in regions Multiple Segmentations c

Label Consistency in Segments • Encourages consistency within super-pixels • Takes the form of a PN Potts model [Kohli et al. CVPR 2007] c

Label Consistency in Segments • Encourages consistency within super-pixels • Takes the form of a PN Potts model [Kohli et al. CVPR 2007] c Cost: 0

Label Consistency in Segments • Encourages consistency within super-pixels • Takes the form of a PN Potts model [Kohli et al. CVPR 2007] c Cost: f (|c|)

Label Consistency in Segments • Encourages consistency within super-pixels • Takes the form of a PN Potts model [Kohli et al. CVPR 2007] Does not distinguish between Good/Bad Segments ! c Cost: f (|c|)

Quality based Label Consistency Label inconsistency cost depends on segment quality • How to measure quality G(c)? • [Ren and Malik ICCV03, Rabinovich et al. ICCV07, many others] • Colour and Texture Similarity • Contour Energy Measure quality from variance in feature responses Higher order generalization of contrast-sensitive pairwise potential

Quality based Label Consistency Segment Quality (darker is better) Mean shift segmentation MSRC image

Robust Consistency Potentials gmax PN Potts 0 Too Rigid! 0 1 Inconsistent Pixels gmax 0 Robust 0 1 T Inconsistent Pixels

Robust Consistency Potentials Maximum Inconsistency Cost Number of Inconsistent Pixels Slope gmax 0 Robust 0 1 T Inconsistent Pixels

Minimizing Higher order Energy Functions • Message passing is computationally expensive • High runtime and space complexity - O(LN) • L = Number of Labels, N = Size of Clique • Efficient BP for Higher Order MRFs [Lan et al. ECCV 06, Potetz CVPR 2007] • 2x2 clique potentials for Image Denoising • Take minutes per iteration (Hours to converge)

Minimizing Higher order Energy Functions • Graph Cut based move making algorithm [Kohli et al. CVPR 2007] • Can handle very high order energy functions • Extremely efficient: computation time in the order of seconds • Only applicable to some classes of functions (PN Potts) • Cannot handle robust consistency potential • This paper • Can minimize a much larger class of higher order energy functions • Same time complexity as [Kohli et al. CVPR 2007]

Move making algorithms Expansion and Swap move algorithms [Boykov Veksler and Zabih, PAMI 2001] • Makes a series of changes to the solution (moves) • Each move results in a solution with smaller energy Current Solution How to minimize move functions? Move to new solution Generate pseudo-boolean move function Minimize move function to get optimal move

Minimizing Move Functions using Graph Cuts Most pairwise CRF models used in Computer Vision lead to submodular move functions Second order Pseudo-boolean Function Minimization (submodular) st-mincut (Positive weights) Optimal moves can be found extremely efficiently using graphs cuts

Minimizing Higher Order Energy Functions: Our results • Result 1 • We show that a large class of higher order potentials lead to higher order submodular move functions • Can be minimized in polynomial time • Submodular Function Minimization • Minimizing general submodular functions is computationally expensive • Complexity O(n6) • Cannot handle large problems! Details in Technical Report

Minimizing Higher Order Energy Functions: Our results • Minimizing Higher order functions using Graph cuts • Higher order functions can be transformed to second order functions by adding auxillary variables • Exponential number of auxillary variables needed in general • Result 2 • Our higher order functions can be transformed to second-order functions using ≤2auxillary variables per potential. • Can be minimized extremely efficiently • Complexity << O(n6) Details in Technical Report

Overview of our Method Higher Order Energy Unary Potentials [Shotton et al. ECCV 2006] + Energy Minimization Contrast Sensitive Pairwise Potentials + Segmentation Solution Higher Order Potentials (Multiple Segmentations)

Experimental results Datasets: MSRC (21), Sowerby (7) [Shotton et al. ECCV 2006] [He et al. CVPR 04]

Qualitative Results Image (MSRC-21) PairwiseCRF Higher order CRF Ground Truth Grass Sheep

Qualitative Results (Contd..) Image (MSRC-21) PairwiseCRF Higher order CRF Ground Truth Results can be improved using image specific colour models Rother et al. SIGGRAPH 2004 Shotton et al. ECCV 2006

Quantitative Results: Problems Rough ground truth segmentations Fine structures have small influence on overall pixel accuracy

Generating Accurate Segmentations • Generated accurate segmentation of 27 images • 30 minutes per image Image (MSRC-21) Original Segmentation New Segmentation

Relationship between Qualitative and Quantitative Results Pairwise CRF Higher order CRF Ground Truth Image (MSRC-21) Overall Pixel Accuracy 95.8% 98.7% Small changes in pixel accuracy can lead to large improvements in segmentation results.

Quantitative Accuracy • Measure accuracy in labelling boundary pixels. • Accuracy evaluated in boundary bands of variable width Trimap (8-pixels) Trimap (16-pixels) Hand-labelled Segmentation Image (MSRC-21)

Quantitative Accuracy • Measure accuracy in labelling boundary pixels. • Accuracy evaluated in boundary bands of variable width

Conclusions • Method to enforce label consistency in image regions • Generalization of the commonly used Pairwise CRF model • Allows integration of pixel and region level features for image labelling problems

Thanks

Running Time Results Time (sec) Number of Segmentations Inconsistency Cost

Qualitative Results (Contd..) Image (MSRC-21) PairwiseCRF Higher order CRF Ground Truth

Transformation to second order functions c Auxiliary variables

Robust Higher Order Potentials For Enforcing Label Consistency

Robust Higher Order Potentials For Enforcing Label Consistency

Presentation Transcript

Higher-order thinking

Teaching for Higher Order Outcomes

Higher Order Derivatives

Higher-Order Questions

Higher Order PCM

Higher order programming

Higher Order Cepstral Moment Normalization (HOCMN) for Robust Speech Recognition

6. Experimental Analysis Visible Boltzmann machine with higher-order potentials:

Higher-Order Functions

Higher-Order Functions

HIGHER ORDER THINKING

Higher-Order Functions

Higher Order Tries

Enforcing Sequential Consistency in SPMD Programs with Arrays

Higher Order Thoughts

Higher Order Aberration

Higher Order Thinking

Higher order derivatives

Higher Order Derivatives

Higher Order Functions

Higher Order Surfaces

Higher Order Tries