The Benefits and Challenges of Collecting Richer Object Annotations

The Benefits and Challenges of Collecting Richer Object Annotations Ian Endres, Ali Farhadi, Derek Hoiem, David Forsyth University of Illinois at Urbana-Champaign

What should we say about objects?

What should we say about objects? Mirrors Vehicle Two-wheeled Motorcycle Gas tank Seat Headlight Lic. Plate Motorcycle Facing right On the street Has a rider Tail light Metal Exhaust Rubber Engine Wheel Wheel

How should we annotate objects? • Pascal VOC Motorcycle

Object Polygons • LabelMe • Distinguish object pixels from background • Malisiewicz, Efros BMVC ‘07 Motorcycle

Attributes Motorcycle Two-wheeled Vehicle • Rich descriptions • Recognize unfamiliar objects • Farhadi et al. CVPR’09 • Lampert et al. CVPR‘09 Facing right Has a rider On the street Seat Wheel Headlight Tail light Handlebars Side mirror Exhaust pipe Engine License plate Motorcycle Painted metal Bare metal Rubber Plastic

Localized Parts • Improved models • Part configuration • Better pose/viewpoint • Cross-category part localization: A. Farhadi, I. Endres, D. Hoiem CVPR‘10 Mirrors Gas tank Seat Headlight Lic. Plate Motorcycle Tail light Exhaust Engine Wheel Wheel

Materials Glass Bare Metal Painted Metal Rubber

How should we annotate objects? Is motorcycle Is two-wheeled Is vehicle Facing right On street Has rider Has seat Has wheel Has headlight Has tail light Has handlebars Has side mirror Has exhaust pipe Has engine Has license plate Has painted metal Has bare metal Has rubber +

The CORE datasetCross-category Object REcognition • 2780 Images – from ImageNet • 3192 Objects – 28 Categories • 26695 Parts – 71 types • 30046 Attributes – 34 types • 1052 Material Images – 10 types Download or browse online: http://vision.cs.uiuc.edu/CORE

Outline of Dataset Creation Objects Attributes Images MTurk Quality Control CORE Parts Materials

Collecting Data: Images Unusual Examples Stylized Images Foreground Only Canonical Poses

Collecting Data: Images

Amazon’s Mechanical Turk Turk Accept Submit Task Completed Task Dataset Reject + Resubmit

Collecting Data: Binary Attributes

Collecting Data: Polygons From Alex Sorokin: http://code.google.com/p/cv-web-annotation-toolkit/

Collecting Data: Materials From Alex Sorokin: http://code.google.com/p/cv-web-annotation-toolkit/

Quality Assurance:Grading Multiple annotations If annotations can be compared easily Visual Inspection If inspection is faster than annotation

Quality Assurance:Establishing Trust Qualification Must qualify to complete tasks Higher barrier of entry Good work record Auto-accept all work after 10 tasks and 90% acceptance rate Automatically accept ~50%

Collection Results: Binary Attributes Before 81.4% Agreement 1. Too many options 2. Unnatural tasks 3. Unclear interface 4. Ambiguous options

Collection Results: Binary Attributes Ambiguous Viewpoint • Before • Facing to the right • Facing toward the camera • After • Right side visible • Front side visible

Collection Results: Binary Attributes Before 81.4% Agreement After 90.7% Agreement

Collection Results: Object Polygons • 5% rejection rate Excellent Acceptable Rejected

Collection Results: Part Polygons • 25% rejection rate Excellent Acceptable Rejected

Collection Results: Materials • 22% rejection rate Excellent Acceptable Rejected

Ensuring High Throughput • Provide a large steady stream of work • Attracts more workers, keeps regulars • Quick grading turnaround • Don’t expect results overnight when using United States workers • Do expect results overnight when using global workers • Beware of language issues

How Much Does it Cost?For 100 Images Turk workers spend ~25 hours Annotator Payment + ~10% Turk Commission

Take Home Messages • We need detailed annotations for rich recognition • Mechanical Turk can provide detailed annotations • Interface/instruction design important • State succinctly, simply • Keep Task Size Manageable • Quality Control necessary

Find and Describeunfamiliar objects. Animal Vehicle Four-legged Mammal Head Wheel Leg Can run, jump Is herbivorous Facing right Moves on road Facing right • Come see our poster on Wednesday: • “Attribute-Centric Recognition for Cross-Category Generalization”

The Benefits and Challenges of Collecting Richer Object Annotations

The Benefits and Challenges of Collecting Richer Object Annotations

Presentation Transcript

THE RICHER, THE POORER

BENEFITS OF THE RMP AND CHALLENGES FOR THE FUTURE

The Richer, the Poorer

Benefits and challenges of euro adoption

CHALLENGES IN COLLECTING, COMPILING AND DISSEMINATION OF VITAL STATISTICS

Annotations

Consolidated Audits: The Benefits and Challenges

Annotations and Evaluations

The Generation of formal Annotations

Benefits and Challenges of Services Liberalization

Environmental Benefits and Challenges of Urbanization

The Benefits and Challenges of Rural Nursing

The challenges of collecting and disseminating indigenous data from the coronial system

Richer Michail

Benefits of Collecting Coins from Around the World

The Benefits and Challenges of RPA Implementation

Backpacker: Benefits of Collecting Moments, Not Things.

Benefits and Challenges of Cloud Computing

The Benefits and Challenges of Rifle Silencer Usage

Benefits of Collecting & Comparing Free Moving Quotes