Expert Object Recognition in Video

Expert Object Recognition in Video Matt McEuen

Ventral Visual Pathway EOR Pathway From Draper, Baek, Boody - 2002 The EOR Pathway • Early vision (feature extraction) • Categorization • Exemplar matching

Feature Extraction Clustering Exemplar Matching Expert Object Recognition

Early Vision: Edge Detection • Gabor filters • Three filter sizes • Four orientations • Even and odd

Rectified energy 0° ... Normalized Sum 135° ... Filter output 90° 45° ... Early Vision: Edge Detection

Early Vision: Line Detection • Non-accidental structural properties • collinearity • parallelism • symmetry • Hough transform

Categorization • Allows a unique subspace for each category • K-Means

Alternating optimization: optimize C optimize cluster membership repeat Categorization

Exemplar Matching • Principal Component Analysis (PCA) • Based on covariance • Visual memory reconstruction

PCA • Calculate covariance matrix of the samples • Get the eigenvectors of the covariance matrix • Choose which eigenvectors to keep • Transform the data with the resulting matrix From Moeslund, 2001

64 x 64 Image 5 Element Vector Exemplar Matching

VENUS • Biologically inspired • Habituation • Low-level features

Knowledge and Hierarchical Learning Architecture Knowledge Based on Object Interactions Increasing Semantic Knowledge Real World Knowledge Knowledge Based on Object Activities Knowledge Based on Identified Objects and Context Knowledge Based on Low Level Features

Benefits of EOR for video • General-purpose • Segmentation: Attention window • Associative memory in VENUS

Problems with EOR for video Why learning?

Problems with EOR for video • Hard training / testing distinction • Lots of processing • The parameter k

Data

(2) (1) Segmentation (3—training only) Data (87,79)

Ventral Visual Pathway VEOR EOR Pathway VEOR architecture

Foreground segmentation Ventral Visual Pathway VEOR EOR Pathway VEOR architecture

Foreground segmentation Object tracking Ventral Visual Pathway VEOR EOR Pathway VEOR architecture

Foreground segmentation Object tracking EOR subsystem Ventral Visual Pathway VEOR EOR Pathway VEOR architecture

Foreground segmentation Object tracking EOR subsystem Membership subsystem Ventral Visual Pathway VEOR EOR Pathway VEOR architecture

Foreground segmentation

Ok Too small Y Associate with existing object New object Overlap? Object tracking N

Extract patch, resize Masking Global PCA Feature extraction

Clustering: goals • Automatically determine k • Facilitate learning • ... efficiently

Clustering: solutions • Reuse of cluster centroids

Clustering: solutions • Reuse of cluster centroids • Cluster growing and splitting

Clustering: solutions • Reuse of cluster centroids • Cluster growing and splitting • Incremental clustering

Clustering: Fuzzy K-means

Done (remember centroids) Optimize cluster membership Optimize cluster centroids First time: 4 random cenroids nth time, n>1: saved centroids Yes FKM all clusters small enough? Yes No change < threshold? No Create additional centroid Clustering

Exemplar matching • PCA • Dirty flags • Output: best match & distance

Membership subsystem • Associative memory • Multi-class [0,1] hypothesis • One exemplar match per image • One membership hypothesis per tracked object

Three kinds of exemplar matching • Matching to a training image • Matching to a different learned object • Matching to the same learned object

Exemplar match: training image • Certainty of 1.0 in one class • Small distance == strong match

Exemplar match: different object • Contribution comes from object • Doesn't matter which image

Exemplar match: same object • No “new” evidence • Rebalance existing evidence • Recontribute match's contribution

Learning • Images of familiar classes, familiar views • New views of familiar classes • New classes

Results – Pet Faces Configuration 1 2 3 4 5 6 Accuracy 67.0% 80.8% 84.6% 81.5% 81.7% 83.8% Clustering algorithm HKM HKM FKM FKM FKM FKM Fuzziness n/a n/a 1.1 1.1 1.1 1.3 PCA on raw images? Yes Yes Yes No Yes Yes Hough transform? Yes No No No No No Incremental clustering? No No Yes Yes No Yes

Results – Vehicle Profiles

Results – Video Clips Global dims. Local dims. Accuracy 10 10 64.4% 50 10 82.2% 50 50 82.2%

Results – Video Clips Global dims. Local dims. Accuracy w/o Trucks 10 10 64.4% 50 10 82.2% 100% 50 50 82.2% 96.6%

Questions?

>> n = [3 5 6; 3 3 2; 5 1 2] n = 3 5 6 3 3 2 5 1 2 >> cov(n) ans = 1.3333 -2.0000 -1.3333 -2.0000 4.0000 4.0000 -1.3333 4.0000 5.3333 >> Covariance

Non-eigenvector: Eigenvector: A square matrix of n dimensions has n eigenvectors. Eigenvectors are orthogonal to one another. Eigenvectors

Accuracy Subspace dimensions From Draper, Baek, Boody, 2004 Draper's Results

Expert Object Recognition in Video

Expert Object Recognition in Video

Presentation Transcript

OBJECT RECOGNITION

Object recognition

Dense Object Recognition

Statistical Object Recognition

Object Recognition

Object Recognition

Visual Object Recognition

Object Recognition

Face Recognition in Video

Visual Object Recognition

Visual Object Recognition

Object recognition

Object Recognition

Object Recognition

Object Recognition

Object recognition

Object Recognition

Object recognition

Object Recognition