Why Categorize in Computer Vision?

Why Categorize in Computer Vision?

Why Use Categories? People love categories!

Why Use Categories? What if we didn’t have categories? Humuhumunukunukuapua'a – “fish that grunts like a pig”

Why Use Categories? Our minds work very intimately with categories • Every common noun in English is a category • Proper nouns name object instances • “this,” “that,” “the,” “my,” “yours,” etc. refer to object instances anonymously

The Categorization Problem

The Categorization Problem Categorization/Classification: Given a set of pre-defined categories, “bin” this image Does not necessarily require object detection Vertical Dimension: • General: “Animal” • Basic: “Bird” • Specific: “Robin”

The Categorization Problem What kinds of categorization are computers good at? • Basic -- especially when using context clues • Specific -- due to low intra-class variation

The Categorization Problem Bad at? • General, due to high intra-class variation and a lack of visual cues

The Categorization Problem Bad at? • Categories defined by non-visual characteristics (like chairs)

Summary • Semantic categories allow humans to convey a large amount of information concisely • We want computers to be able to do the same • What work has been done on this problem? Has it been successful?

Uses of Categorization

Two Examples • Using Context in Categorization • Fine-Grain Object Classification

Caltech 101 (2003) • Dataset for basic-level categorization • Objects from 101 classes • Famously difficult

Categorization with Context Goal: Resolve ambiguity between similar-looking objects of different classes using the semantic context of an object Rabinovich et al. (UC San Diego): Objects in Context First paper to attempt to use context at the object level PASCAL 2007 dataset

Categorization with Context

Categorization with Context Approach • Segment image to preserve some spatial data • Perform Bag-of-Features to give an initial ranked list of labels for each segment • Use a Conditional Random Field (CRF) framework to find agreement between segment labels

Categorization with Context

Bag-of-Features with Segmentation Labeling Segments: Confidence:

Conditional Random Field Way to assign joint probabilities to elements without considering every possible combination in the training set

Conditional Random Field Idea • Given set of segments S, set of labels C • Want to find p(C | S) without knowing p(S) • Associate a special graph with C that obeys the “Markov Property” (uses S) • The ordered pair (S, C) is a CRF conditioned on S

Conditional Random Field

Results

Results False correction

Fine-Grain Classification

Fine-Grain Image Categorization Challenge: need good classifiers that capture detail well

Fine-Grain Image Categorization Yao et al. (Stanford): Combining Randomization and Discrimination for Fine-Grained Image Categorization Approach Random forest with discriminative classifiers This is a kind of machine learning framework that allows us to handle the fine detail in this problem.

Fine-Grain Image Categorization

Random Discriminative Tree Approach • For each tree node, train an SVM classifier for a randomly sampled image region • At each node, make a yes-or-no decision • Uses grayscale SIFT descriptors

Random Discriminative Tree

Results

Conclusion • Semantic categories allow humans to convey a large amount of information concisely • Categorization has been used for basic-level object detection and scene recognition • Fine-grain categorization can provide us with expert-level classification of objects • Not all categories are defined by visual characteristics!

Questions?

Why Categorize in Computer Vision?

Why Categorize in Computer Vision?

Presentation Transcript

3D Vision

Computer Vision: Summary and Anti-summary

Research on Vision and Aging at the UC Berkeley School of Optometry and Vision Science Graduate Group presented by St

CPSC 425: Computer Vision (Jan-April 2007)

Computer Vision: CSE 803

Introduction to Computer Vision

COMPUTER VISION: SOME CLASSICAL PROBLEMS

[ The computer screen and vision ]

Advanced Computer Vision

Hierarchical Models of Vision: Machine Learning/Computer Vision

3D Vision

Name ways we can categorize people in the classroom Example: gender, eye color…etc.

Scalable Learning in Computer Vision

Computer and Robot Vision II

Computer and Robot Vision II

Computer Vision

Projective Geometry for Computer Vision

Comp300a: Introduction to Computer Vision

Computer Vision: Summary and Discussion

Computer Vision: CSE 803

Computer Vision