Object Recognition as Ranking Holistic Figure-Ground Hypotheses

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and CristianSminchisescu

Outline Introduction Method Overview Segment Categorization Segment Post-Processing Experiment Conclusion

Introduction Object detector : Top-down approaches

Introduction Semantic segmentation results produced by our algorithm

Introduction Ideally segment Can model entire object At least sufficiently distinct parts of them

Introduction Constrained Parametric Min Cuts algorithm (CPMC) [6]

Method Overview CPMC Number of segment This paper focus

Method Overview Recognition framework Segment categorization Segment post-processing

Segment categorization Scoring function Sort Combine high-rank segment

Segment post-processing COW

Segment Categorization Multiple Segment and Features Learning Scoring Functions with Regression Learning the Kernel Hyperparameters Compare with Structural SVM

Multiple Segment and Features Model object appearance： Extracted four bag of words of SIFT Two on foreground Two on Background, aim to improve recognition

Multiple Segment and Features Encode shape information： A bag of word of local sharp contexts [2] : measure similarity between shapes Three pyramid HOGs [5] : classifying images by the object categories they contain

Multiple Segment and Features Chi-square kernel ： Computed from each histogram feature and use a weighted sum of such kernel for regression

Learning Scoring Functions with Regression : Image I with ground truth segments : Segmentation algorithm provides a set of segment : Denote the K object categories : Indicator function

Learning Scoring Functions with Regression Quality function : Measure overlap with all denote the value for and is the maximal overlap with ground truth segments belonging to , and do not appear

Learning Scoring Functions with Regression Learn the function for each : use nonlinear SVR(Support Vector Regression) to regress against ,the features extracted from

Learning Scoring Functions with Regression Use kernel trick : : support vector from training set : obtained by the SVR optimizer : maximal score of the segment : final class of the segment

Learning the Kernel Hyperparameters Fundamental equation (3) is infeasible to estimate all kernel hyperparameter via grid search Use subset of data comprised segments that best overlap each ground truth segment

Compare with Structural SVM The structural SVM(in [3]) formulation for sliding window prediction is :

Connections with Structural SVM Our algorithm VS Structural SVM Structural SVM score the bounding box and Our algorithm score the segment Important advantage Guarantee the highest rank for the ground truth Correct ranking for all segment

Segment Post-Processing Simple decision rule : avoid the post-processing and direct choose the segment , cannot detect multiple objects Our methodology : weighted consolidation of segment and sequential interpretation strategy

Segment Post-Processing

Segment Post-Processing To decide which segments to combine Consider segment with intersection > 0.75 for combination

Segment Post-Processing Highest-scoring segment as seed Group segments that intersect it Generated a final mask Proceed with the next higher rank segment Choose segment that are not overlapping with 3

Segment Post-Processing Generate the score for the pixels in the mask by (9), only pixels with score > 0.65 are displayed in the mask.

Experiments Classification : Caltech-101 Detection : ETHZ Shape classes Segmentation : VOC 2009

Classification

Detection

OWT-UCM Masks

Segmentation Bounding box

Conclusion CPMC Categorization Post-processing Achieve good performance Future work : improve the scalability to be able to process hundreds of thousands of image

Object Recognition as Ranking Holistic Figure-Ground Hypotheses

Object Recognition as Ranking Holistic Figure-Ground Hypotheses

Presentation Transcript

OBJECT RECOGNITION

Object recognition

Figure /Ground Relationships

Object/Shape/Figure

Figure/Ground

Figure and Ground

Object Recognition

Figure/Ground Relationships

Object Recognition

Visual Object Recognition

Object Recognition

Visual Object Recognition

Visual Object Recognition

Object recognition

Object Recognition

Object Recognition

Object Recognition

Object recognition

Object Recognition

Object recognition

Object Recognition