300 likes | 315 Views
This study aims to automatically construct a generic hierarchical shape model from exemplars. The challenges include dealing with different appearances, ambiguous features, and lack of one-to-one correspondence. The proposed approach involves layered motion segmentations, categorical features matching, and many-to-many graph matching.
E N D
Learning Decompositional Shape Models from Examples Alex Levinshtein Cristian Sminchisescu Sven Dickinson University of Toronto
Hierarchical Models Manually built hierarchical model proposed by Marr And Nishihara (“Representation and recognition of the spatial organization of three dimensional shapes”, Proc. of Royal Soc. of London, 1978)
Our goal Automatically construct a generic hierarchical shape model from exemplars • Challenges: • Cannot assume similar appearance among different exemplars • Generic features are highly ambiguous • Generic features may not be in one-to-one correspondence
Layered Motion SegmentationsKumar, Torr and Zisserman, ICCV 2005 • Models image projection, lighting and motion blur • Models spatial continuity, occlusions, and works over multiple frames (cf. earlier work by Jojic & Frey, CVPR 2001) • Estimates the number of segments, their mattes, layer assignment, appearance, lighting and transformation parameters for each segment • Initialization using loopy BP, refinement using graph cuts
Constellation models Fergus, R., Perona, P., and Zisserman, A., “Object Class Recognition by Unsupervised Scale-Invariant Learning”, CVPR 2003
Categorical features Match
Automatically constructed Hierarchical Models Input: Question: What is it? Output:
Stages of the system Exemplar images Extract Blob Graphs Blob graphs Match Blob Graphs (many-to-many) Many-to-many correspondences Extract Parts Extract Decomposition Relations Extract Attachment Relations Model parts Model decomposition relations Model attachment relations Assemble Final Model
Blob Graph Construction Exemplar images Extract Blob Graphs Blob graphs Match Blob Graphs (many-to-many) Many-to-many correspondences Extract Parts Extract Decomposition Relations Extract Attachment Relations Model parts Model decomposition relations Model attachment relations Assemble Final Model
Blob Graph Construction • On the Representation and Matching of Qualitative Shape at Multiple Scales • A. Shokoufandeh, S. Dickinson, C. Jonsson, L. Bretzner, and T. Lindeberg,ECCV 2002 • Edges are invariant to articulation • Choose the largest connected component.
Blob Graph Construction Perceptual grouping of blobs: Connectivity measure: max{d1/major(A), d2/major(B)}
Feature matching Exemplar images Extract Blob Graphs Blob graphs Match Blob Graphs (many-to-many) Many-to-many correspondences Extract Parts Extract Decomposition Relations Extract Attachment Relations Model parts Model decomposition relations Model attachment relations Assemble Final Model
Feature matching One-to-one matching. Rely on shape and context, not appearance! Many-to-many matching
A Many-to-Many Graph Matching Framework 1. Embed graphs with low distortion to yield weighted point distributions. 2. Compute many-to-many correspondences between the two distributions using EMD. 3. The computed flows yield a many-to-many node correspondence between the two graphs. Demirci, Shokoufandeh, Dickinson, Keselman, and Bretzner (ECCV 2004)
Feature embedding and EMD Spectral embedding
Returning to our set of inputs • Many-to-many matching of every pair of exemplars.
Part Extraction Exemplar images Extract Blob Graphs Blob graphs Match Blob Graphs (many-to-many) Many-to-many correspondences Extract Parts Extract Decomposition Relations Extract Attachment Relations Model parts Model decomposition relations Model attachment relations Assemble Final Model
Extracting attachment relations Exemplar images Extract Blob Graphs Blob graphs Match Blob Graphs (many-to-many) Many-to-many correspondences Extract Parts Extract Decomposition Relations Extract Attachment Relations Model parts Model decomposition relations Model attachment relations Assemble Final Model
Extracting attachment relations Number of times blobs drawn from the two clusters were attached is high Right arm is typically connected to torso in exemplar images ! Number of times blobs from the two clusters co-appeared in an image. Torso Right Arm
Extracting decomposition relations Exemplar images Extract Blob Graphs Blob graphs Match Blob Graphs (many-to-many) Many-to-many correspondences Extract Parts Extract Decomposition Relations Extract Attachment Relations Model parts Model decomposition relations Model attachment relations Assemble Final Model
Extracting decomposition relations Left Arm Upper Lower
Assemble Final Model Exemplar images Extract Blob Graphs Blob graphs Match Blob Graphs (many-to-many) Many-to-many correspondences Extract Parts Extract Decomposition Relations Extract Attachment Relations Model parts Model decomposition relations Model attachment relations Assemble Final Model
Conclusions • Generic models must be defined at multiple levels of abstraction, as Marr proposed. • Coarse shape features, such as blobs, are highly ambiguous and cannot be matched without contextual constraints. • Moreover, features that exist at different levels of abstraction must be matched many-to-many in the presence of noise. • The many-to-many matching results can be analyzed to yield both the parts and relations of a decompositional model. • Preliminary results indicate that a limited decompositional model can be learned from a set of noisy examples.
Future work • Construct models for objects other than humans – objects with richer decompositional hierarchies. • Automatically learn perceptual grouping relations between blobs from labeled examples. • Develop indexing and matching frameworks for decompositional models.