130 likes | 147 Views
Object Recognition a Machine Translation. Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky. Lexicons. A vocabulary of terms used in a subject A specialized list of terms Devices that predict one representation given another representation. Dataset. Aligned bitext
E N D
Object Recognition a Machine Translation Learning a Lexicon for a Fixed Image Vocabulary Miriam Miklofsky
Lexicons • A vocabulary of terms used in a subject • A specialized list of terms • Devices that predict one representation given another representation
Dataset • Aligned bitext • Annotated images • Images with regions • Unknown which region of image goes with which word from text
Clustering • K means clustering • Vector quantize the image region representation • Kullback-Leibler divergence • Relative entropy • Measure of difference of two probability distributions over the same event space
Evaluation • Auto annotate images • Quantize regions • Use lexicon to determine word • Annotate image with word
Results - Annotation • Base results • 80 words of 371 word vocabulary could be predicted • Retraining • Similar results but some words with higher recall and precision
Results(cont.) • Null probability • Recall decreases • Precision increases • Clustering of like words • Recall values of clusters higher than for single words
Results -Correspondence • Base results • Some good words up to 70% correct prediction • Null prediction • Predict good words with greater probability • Word clustering • Prediction rate generally increases
Evaluation • Human evaluation • Images viewed by hand • Somewhat subjective