O BJ C UT

UNIVERSITY OF OXFORD OBJ CUT M. Pawan Kumar Philip Torr Andrew Zisserman

Aim • Given an image, to segment the object Object Category Model Segmentation Cow Image Segmented Cow • Segmentation should (ideally) be • shaped like the object e.g. cow-like • obtained efficiently in an unsupervised manner • able to handle self-occlusion

Challenges Intra-Class Shape Variability Intra-Class Appearance Variability Self Occlusion

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Object Seed Pixels Cow Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Object Seed Pixels Background Seed Pixels Cow Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Segmented Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Object Seed Pixels Background Seed Pixels Cow Image

Motivation Magic Wand • Current methods require user intervention • Object and background seed pixels (Boykov and Jolly, ICCV 01) • Bounding Box of object (Rother et al. SIGGRAPH 04) Segmented Image

Motivation • Problem • Manually intensive • Segmentation is not guaranteed to be ‘object-like’ Non Object-like Segmentation

Our Method • Combine object detection with segmentation • Borenstein and Ullman, ECCV ’02 • Leibe and Schiele, BMVC ’03 • Incorporate global shape priors in MRF • Detection provides • Object Localization • Global shape priors • Automatically segments the object • Note our method completely generic • Applicable to any object category model

Outline • Problem Formulation • Form of Shape Prior • Optimization • Results

Problem • Labelling m over the set of pixels D • Shape prior provided by parameter Θ • Energy E (m,Θ) = ∑Φx(D|mx)+Φx(mx|Θ) + ∑ Ψxy(mx,my)+ Φ(D|mx,my) • Unary terms • Likelihood based on colour • Unary potential based on distance from Θ • Pairwise terms • Prior • Contrast term • Find best labelling m* = arg min ∑ wi E (m,Θi) • wi is the weight for sample Θi Unary terms Pairwise terms

MRF • Probability for a labellingconsists of • Likelihood • Unary potential based on colour of pixel • Prior which favours same labels for neighbours (pairwise potentials) mx m(labels) Prior Ψxy(mx,my) my Unary Potential Φx(D|mx) x y D(pixels) Image Plane

Example Cow Image Object Seed Pixels Background Seed Pixels Φx(D|obj) x … x … Φx(D|bkg) Ψxy(mx,my) y … y … … … … … Prior Likelihood Ratio (Colour)

Example Cow Image Object Seed Pixels Background Seed Pixels Prior Likelihood Ratio (Colour)

Contrast-Dependent MRF • Probability of labelling in addition has • Contrast term which favours boundaries to lie on image edges mx m(labels) my x Contrast Term Φ(D|mx,my) y D(pixels) Image Plane

Example Cow Image Object Seed Pixels Background Seed Pixels Φx(D|obj) x … x … Φx(D|bkg) Ψxy(mx,my)+ Φ(D|mx,my) y … y … … … … … Prior + Contrast Likelihood Ratio (Colour)

Example Cow Image Object Seed Pixels Background Seed Pixels Prior + Contrast Likelihood Ratio (Colour)

Our Model • Probability of labelling in addition has • Unary potential which depend on distance from Θ (shape parameter) Θ (shape parameter) Unary Potential Φx(mx|Θ) mx m(labels) my Object Category Specific MRF x y D(pixels) Image Plane

Example Cow Image Object Seed Pixels Background Seed Pixels ShapePriorΘ Prior + Contrast Distance from Θ

Example Cow Image Object Seed Pixels Background Seed Pixels ShapePriorΘ Prior + Contrast Likelihood + Distance from Θ

Outline • Problem Formulation • E (m,Θ) = ∑Φx(D|mx)+Φx(mx|Θ) + ∑ Ψxy(mx,my)+ Φ(D|mx,my) • Form of Shape Prior • Optimization • Results

Layered Pictorial Structures (LPS) • Generative model • Composition of parts + spatial layout Layer 2 Spatial Layout (Pairwise Configuration) Layer 1 Parts in Layer 2 can occlude parts in Layer 1

Layered Pictorial Structures (LPS) Cow Instance Layer 2 Transformations Θ1 P(Θ1) = 0.9 Layer 1

Layered Pictorial Structures (LPS) Cow Instance Layer 2 Transformations Θ2 P(Θ2) = 0.8 Layer 1

Layered Pictorial Structures (LPS) Unlikely Instance Layer 2 Transformations Θ3 P(Θ3) = 0.01 Layer 1

LPS for Detection • Learning • Learnt automatically using a set of examples • Detection • Matches LPS to image using Loopy Belief Propagation • Localizes object parts

Optimization • Given image D, find best labelling as m* = arg max p(m|D) • Treat LPS parameter Θas a latent (hidden) variable • EM framework • E : sample the distribution over Θ • M : obtain the labelling m

E-Step • Given initial labelling m’, determine p(Θ|m’,D) • Problem Efficiently sampling from p(Θ|m’,D) • Solution • We develop efficient sum-product Loopy Belief Propagation (LBP) for matching LPS. • Similar to efficient max-product LBP for MAP estimate • Felzenszwalb and Huttenlocher, CVPR ‘04

Results • Different samples localize different parts well. • We cannot use only the MAP estimate of the LPS.

M-Step • Given samples from p(Θ|m’,D), get new labelling mnew • Sample Θiprovides • Object localization to learn RGB distributions of object and background • Shape prior for segmentation • Problem • Maximize expected log likelihood using all samples • To efficiently obtain the new labelling

M-Step w1 = P(Θ1|m’,D) Cow Image Shape Θ1 RGB Histogram for Background RGB Histogram for Object

M-Step w1 = P(Θ1|m’,D) Cow Image Shape Θ1 Θ1 m(labels) Image Plane D(pixels) • Best labelling found efficiently using a Single Graph Cut

Segmentation using Graph Cuts Obj x … y … … … m z … … Bkg

M-Step w2 = P(Θ2|m’,D) Cow Image Shape Θ2 RGB Histogram for Background RGB Histogram for Object

M-Step w2 = P(Θ2|m’,D) Cow Image Shape Θ2 Θ2 m(labels) Image Plane D(pixels) • Best labelling found efficiently using a Single Graph Cut

M-Step Θ1 Θ2 w1 + w2 + …. Image Plane Image Plane m* = arg min ∑ wi E (m,Θi) • Best labelling found efficiently using a Single Graph Cut

Results Using LPS Model for Cow Image Segmentation

Results Using LPS Model for Cow In the absence of a clear boundary between object and background Image Segmentation

Results Using LPS Model for Cow Image Segmentation

Results Using LPS Model for Horse Image Segmentation

Results Image Our Method Leibe and Schiele

Results Shape Shape+Appearance Appearance Without Φx(mx|Θ) Without Φx(D|mx)

Conclusions • New model for introducing global shape prior in MRF • Method of combining detection and segmentation • Efficient LBP for detecting articulated objects • Future Work • Other shape parameters need to be explored • Method needs to be extended to handle multiple visual aspects

O BJ C UT

O BJ C UT

Presentation Transcript

Gvmk,bj .

Gvmk,bj .

LOTO: L ock O ut T ag O ut

o c

O BJ C UT & Pose Cut CVPR 05 ECCV 06

“ Beyond Sounding O ut ”

Keep Sun R ays O ut

Stepping O ut I n Style

Gvmk,bj .

Fifth Graders G o O ut T o Lunch

BJ Bennion

BJ Concepcion

Betty (BJ) Barclay

Bellamy, BJ

bj shoes

O BJ C UT

O BJ C UT

Presentation Transcript

Gvmk,bj .

Gvmk,bj .

LOTO: L ock O ut T ag O ut

o c

O BJ C UT &amp; Pose Cut CVPR 05 ECCV 06

“ Beyond Sounding O ut ”

Keep Sun R ays O ut

Stepping O ut I n Style

Gvmk,bj .

Fifth Graders G o O ut T o Lunch

BJ Bennion

BJ Concepcion

Betty (BJ) Barclay

Bellamy, BJ

bj shoes

O BJ C UT & Pose Cut CVPR 05 ECCV 06