500 likes | 769 Views
Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification. Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on Andrea Frome , EECS, UC Berkeley Yoram Singer, Google, Inc Fei Sha , EECS, UC Berkeley
E N D
Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on Andrea Frome , EECS, UC Berkeley Yoram Singer, Google, Inc FeiSha , EECS, UC Berkeley JitendraMalik, EECS, UC Berkeley
Outline • Introduction • Training step • Testing step • Experiment & Result • Conclusion
Outline • Introduction • Training step • Testing step • Experiment & Result • Conclusion
What we do? • Goal • classify an image to a more appropriate category • Machine learning • Two steps • Training step • Testing step
Outline • Introduction • Training step • Testing step • Experiment & Result • Conclusion
Flow chart: training Generate features each image from dataset, ex: SIFT or geometric blur Compute distance dji, dki Input distances to SVM for training , evaluate W
Flow chart: training Generate features each image from dataset, ex: SIFT or geometric blur Compute distance dji, dki Input distances to SVM for training , evaluate W
Choosing features • Dataset: Caltech101 • Patch-based Features • SIFT • Old school • Geometric Blur • It’s a notion of blurring • The measure of similarity between image patches • The extension of Gaussian blur
Flow chart: training Generate features each image from dataset, ex: SIFT or geometric blur Compute distance dji, dki Input distances to SVM for training , evaluate W
Triplet • dji is the distance from image j to i • It’s not symmetric, ex: dji≠dij • dki > dji dji dki
How to compute distance • L2 norm dji, 1 1 Image i 2 Image j 3 m features dji, 1 distance vectordji
Example • Given 101 category, 15 images each category 101*15 Featurej distance vector distance vector 101*15 Image j vs training data
Flow chart: training Generate features each image from dataset, ex: SIFT or geometric blur Compute distance dji, dki Input distances to SVM for training , evaluate W
Machine learning: SVM • Support Vector Machine • Function: Classify prediction • Supervised learning • Training data are n dimension vector
Example • Male investigate • Annual income • Free time • Have girlfriend?
space free vector income
Support vector free Model income
SVM standard mathematical expression Trade-off
In this paper • Goal: to get the weight vector W 101*15 wj feature wj, 1 Image weight wj of W
How to choose Triplets? • Reference Image • Good friend - In the same class • Bad friend - In the different class • Ex: 101category, 15 images per category • 14 good friends & 15*100(1500) bad friends • 15*101(1515) reference images • total of about 31.8 million triplets
Mathematical expression(1/2) • Idealistic: • Scaling: • Different: The length of Weight i 0 0 triplet
Mathematical expression(2/2) • Empirical loss: • Vector machine:
Dual variable • Iterate the dual variables:
Early stopping • Satisfy KTT condition • In mathematics, a solution in nonlinear programming to be optimal. • Threshold • Dual variable updatefalls below a value
Outline • Introduction • Training step • Testing step • Experiment & Result • Conclusion
Flow chart: testing Query an image i Calculate Dxi, xis all training data, except itself. Output the most appropriate category
Flow chart: testing Query an image i Calculate Dxi, xis all training data, except itself. Output the most appropriate category
Query image? • Goal: classify the query image to an appropriate class • Using the remaining images in the dataset as the query image
Flow chart: testing Query an image i Calculate Dxi, xis all training data, except itself. Output the most appropriate category
Distance function(1/2) • Query image i Image i feature dxi, 1 distance vector distance vector 101*15 Image ivs all training data
Distance function(2/2) 101*15 Dji Image I vs all the training data
Flow chart: testing Query an image i Calculate Dxi, xis all training data, except itself. Output the most appropriate category
How to choose the best image? • Modified 3-NN classifier • no two images agree on the class within the top 10 • Take the class of the top-ranked image of the 10
Outline • Introduction • Training step • Testing step • Experiment & Result • Conclusion
Experiment & Result • Caltech 101 • Feature • Geometric blur (shape feature) • HSV histograms (color feature) • 5, 10, 15, 20 training images per category
Outline • Introduction • Training step • Testing step • Experiment & Result • Conclusion
Conclusion • Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification