Effective Image Database Search via Dimensionality Reduction

Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and HenrikAanæs IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

Outline • Introduction • Methods • LF-clustering • Experiments and Results • Discussion and Conclusion

Introduction • The bag-of-words approach • Feature extraction from the database images • Building the bag-of-words representation • Searching with a query image

Introduction The Bag-of-word Model

Methods Feature representation Clustering Feature assignment Image matching

Feature representation • PCA is applied to reduce the dimensionality of the feature vectors • The reduction of the SIFT descriptor is from 128 to between 3 and 12 dimensions • After dimension reduction we add color to our features • the mean RGB value in a 10 × 10 pixels patch around the localization of each feature

Feature representation is the PCA reduced SIFT feature is the mean RGB values is a weighing parameter ( ) normalized to unit length normalized

Clustering Similar but faster than Mean-shift clustering

Feature assignment [16] D. Nisterand H. Stewenius. Scalable recognition with a vocabulary tree. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 2161–2168, June 2006. • Similarity of images are found by comparing frequency vectors of a query image to images in the database • Give each visual words a weight[16]

Image matching • Frequency vectors are compared using the norm • which is found to be superior to the euclidean distance[16] • norm gives equal weight to the overlapping and non-overlapping parts • Inverted files are used for fast image retrieval

Experiments and Results http://www.vis.uky.edu/~stewe/ukbench/ • Data set • first 1400 images form [16] • a series of 4 images of the same scene • Use three of the images from one scene to train the model and the last for testing • The test result is the percentage of the correct images ranked in top 3 • data set is relatively small

Experiments and Results Data set：

Experiments and Results • Experiments • Color added PCA SIFT • 3, 8, and 12 dimensional PCA SIFT featuresadded features are 6, 11, and 15 dimensions • compare with SIFT features reduced with PCA to 6, 11 and 15 dimensions (without color) • Clustering experiments • LF-clustering • from 8,000 to 12,000 clusters • k-means • 10 clusters in 4 levels resulting in 10,000 clusters

Experiments and Results Results

Discussion and Conclusion • did not apply LF-clustering to the 128 dimensional SIFT features, because it performed very poorly • for future work the model should be tested on a larger set of data • A problem of the design of the bag-of-words model is it static nature • not designed for adding or removing images from the database

Effective Image Database Search via Dimensionality Reduction

Effective Image Database Search via Dimensionality Reduction

Presentation Transcript

Dimensionality reduction

Dimensionality Reduction

Nonlinear Dimensionality Reduction for Hyperspectral Image Classification

Dimensionality reduction

Dimensionality reduction

Nonlinear Dimensionality Reduction for Hyperspectral Image Classification

Dimensionality reduction

Dimensionality reduction

Dimensionality Reduction

Dimensionality Reduction

Dimensionality Reduction

Demystifying Dimensionality Reduction

Dimensionality reduction

Dimensionality reduction

Dimensionality Reduction

Video Analysis via Nonlinear Dimensionality Reduction Technique

Dimensionality reduction

Dimensionality reduction