Global and Efficient Self-Similarity for Object Classification and Detection

Global and Efficient Self-Similarity for Object Classification and Detection Thomas Deselaers and Vittorio Ferrari CALVIN group Computer Vision Laboratory ETH Zurich Switzerland CVPR 2010

Conventional Image Descriptors Measure direct image properties gradients colors

Self-Similarity vs Conventional Descriptors [Shechtman, Irani CVPR 07] Assumption of conventional image descriptors There is a direct visual property shared by images of objects of the same class (e.g. colors, gradients, …). This property can be used to compare images. Self-similarity: Indirect property: geometric layout of repeating patches within an image More general property

Local Self-Similarity Descriptors [Shechtman, Irani CVPR 07]

Using Local Self-Similarity Descriptors Applications: object recognition, image retrieval, action recognition • Ensemble matching [Shechtman CVPR 07] • Nearest neighbor matching [Boiman CVPR 08] • Bag of local self-similarities [Gehler ICCV09, Vedaldi ICCV09, Hörster ACMM08, Lampert CVPR09, Chatfield ICCV09 WS] Compute LSS descriptors for an image Assign the LSS descriptors to a codebook Represent the image as a histogram of LSS descriptors

Self-Similarity goes Global Capture long-range self-similarities and their spatial arrangement

Global Self-Similarity Tensor compute self-similarity between all pairs of pixels 4D self-similarity tensor Note: local self-similarities included

Problems with the GSS Tensor 11 11 300 500 ∼ 20h ∼ 80GB Aim: Reduce both Computation time: Memory requirement:

Outline • Efficient global self-similarity tensor • Global self-similarity descriptors • Bag of correlation surfaces • Self-similarity hypercubes • Detection with self-similarity hypercubes • Efficient sliding window • Efficient subwindow search • Experiments • Global self-similarity better than local self-similarity • Complementary to conventional descriptors • Object detection possible

Efficient Global Self-Similarity Tensor Find an efficient approximation to If two patches are assigned to the same prototype, they are similar Quantize patches according to codebook Reduces runtime to speedup: 750

Efficient Global Self-Similarity Two patches are only similar if they are assigned to the same prototype Reduces memory to reduction:

Patch Prototype Codebooks Remember: Self-similarity encodes image content indirectly Image-specific codebooks can be smaller than conventional ones see paper for more generic codebooks and extensive evaluation

Global Self-Similarity Descriptors • So far: • Compact GSS computed efficiently • Now: • Descriptors that can be used in machine learning classifiers • Fixed dimensionality • Compact representation • Self-similarity hypercubes: now • Bag of correlation surfaces: only in the paper

Self-Similarity Hybercubes SSH of size

SSHs for Detection • Computing SSH naïvely requires operations • Sliding windows has to evaluate many windows operations

Efficient Computation of SSHs Compute integral self-similarity tensor: can be obtained using 16 lookups in  160000 operations to compute SSH for an image window  ∼5000x speedup

Efficient Subwindow Search for SSH • Derive an upper bound on the score of a set of windows • Section 5.2 in our paper • Similar to [Lampert PAMI09]

Experiments: Object classification PASCAL 07 objects • 9608 cropped images of objects from PASCAL 07 • 20 classes Task: Classify each test image into one of 20 classes Model: Linear SVM Train: train+val Test: test

Classification on the PASCAL 07 objects set classification accuracy [%] + GSS outperform LSS + Self-Similarity is truly complementary to conventional descriptors

Experiments: Object detection e.g. [Ferrari CVPR07, Maji CVPR09] ETHZ Shape Classes • 255 images • 5 classes (apple logos, bottles, giraffes, mugs, swans) Task: Detect objects in images Detector: Linear SVM, sliding windows

Detection Results DR at FPPI 0.4 } bottles giraffes SSH + SSH outperforms BOLSS + it is possible to use GSS for detection with good results apple logos swans mugs DR at 0.5 PASCAL overlap } BoLSS Comparison results (avg): [Ferrari CVPR07]: 71.9 [Maji CVPR09]: 93.2 … many more FPPI 0.4

Runtimes for Computing Descriptors • 200x200 image • GSS tensor • directly: 5512s (∼1.5 hours) • using our method: 81s (∼1.5 minutes) • Computing descriptors: few seconds • Our method: 70x speedup • For Reference: • GIST: 0.4s • BOLSS: 0.7s

Runtimes for Detection June 2014 Given the prototype assignment map (80s) (once only) SSH sliding window: 30s/img (once per class) For Comparison • Computing direct GSS tensor for 25000 windows: 4 years/img Speedup: ∼1 million ⇒ Using our methods, GSS can be used for object detection For Reference: • Felzenszwalb PAMI 09: 5s.

Feasible Global and Efficient Self-Similarity for Object Classification and Detection Thomas Deselaers and Vittorio Ferrari CALVIN group Computer Vision Laboratory ETH Zurich Switzerland CVPR 2010

Conclusion • self-similarity should be considered globally • Global self-similarity performs better than local self-similarity • truly complementary to conventional descriptors • global self-similarity is feasible • efficient computation of self-similarity • two descriptors based on self-similarity • global self-similarity for detection • code will be available soon

Thank you for your attention Thomas Deselaers and Vittorio Ferrari Global and Efficient Self-Similarity for Object Classification and Detection Code will be available http://www.vision.ee.ethz.ch/~calvin

Global and Efficient Self-Similarity for Object Classification and Detection

Global and Efficient Self-Similarity for Object Classification and Detection

Presentation Transcript

Object Detection and Tracking

Self-Similarity

Object detection

An Efficient and Fast Active Contour Model for Salient Object Detection

Object Detection and Recognition

Self - similarity and points of interest

Object Detection

Techniques and Data Structures for Efficient Multimedia Similarity Search

Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman

Malware Classification And Detection

Sparselet Models for Efficient Multiclass Object Detection

Efficient Distribution Mining and Classification

Self-Similarity

Measures for Classification and Detection in Steganalysis

Adaboost and Object Detection

3D Object Modelling and Classification

Emotion Classification and Detection

Object detection

Self-Similarity

Self-Similarity: Fractals

Efficient Comment Classification through NLP and Fuzzy Classification