220 likes | 246 Views
Object class recognition using unsupervised scale-invariant learning. Rob Fergus Pietro Perona Andrew Zisserman. Oxford University California Institute of Technology. Overview. Task: Recognition of object categories. 3 main issues:. Representation Learning Recognition.
E N D
Object class recognition using unsupervised scale-invariant learning Rob Fergus Pietro Perona Andrew Zisserman Oxford University California Institute of Technology
Overview Task: Recognition of object categories 3 main issues: Representation Learning Recognition
Some object categories Learn from just examples Difficulties: • Size variation • Background clutter • Occlusion • Intra-class variation
Model: constellation of Parts • Yuille, 91 • Brunelli & Poggio, 93 • Lades, v.d. Malsburg et al. 93 • Cootes, Lanitis, Taylor et al. 95 • Amit & Geman, 95, 99 • Perona et al. 95, 96, 98, 00 Fischler & Elschlager, 1973
Generative probabilistic model 0.8 0.75 0.9 Foreground model p(A)=PpartsG(Apart|cpart,Vpart) Gaussian shape pdf Gaussian part appearance pdf Gaussian relative scale pdf Log(scale) Prob. of detection Clutter model Clutter pdf Uniform relative scale pdf Gaussian appearance pdf Uniform shape pdf Ppoisson(N|l) Log(scale) p(x)=A-1(uniform) Poission pdf on # detections
Interest Operator • Kadir and Brady's interest operator. • Finds maxima in entropy over scale and location
Representation of appearance c1 c2 Projection onto PCA basis 11x11 patch Normalize c15
Experimental procedure Two series of experiments: Scale variant (using pre-scaled images) Scale invariant • Datasets: • Motorbikes, Faces, Spotted cats, Airplanes, Cars from behind and side • Between 200 and 800 images in size • Training • 50% images • No identifcation of object within image • Testing • 50% images • Simple object present/absent test • ROC equal error rate computed, using background set of images
Motorbikes Equal error rate: 7.5%
Frontal faces Equal error rate: 4.6%
Equal error rate: 9.8% Airplanes
Scale-invariant Spotted cats Equal error rate: 10.0%
Scale-invariant cars Equal error rate: 9.7%
ROC equal error rates Pre-scaled data (identical settings): Scale-invariant learning and recognition:
Discussion Questions • What prior work does this paper build on (besides Kadir’s) ? • What was significant about the training for object vs. background? • What are the main contributions of the Fergus work? • What are the drawbacks to this approach?