Science-Driven Data with Amazon Mechanical Turk

What is Mechanical Turk Quality 1 4 Science-Driven Data with Amazon Mechanical Turk • Quality doesn’t come for free • If results are blindly accepted, up to 50% are bad. • Options to control quality: • Filter workers: performance history • When a worker has only 50% of tasks accepted, you should expect to reject half of their work. • In “alignment”, requiring 90% accept rate v.s. 25% resulted in 88% accepted v.s. 55%. • Filter workers: qualification • Do qualification test to check that workers understand the instructions. In “attributes”, improved agreement from 65% to 81% (v.s. expert at 84%) • Do automatic verification • In “alignment”, residual distance between landmarks predicts quality very well. 70% of bad submissions are confused with only 5% of good submissions. • Do manual grading. • In most cases manual grading is easy. It is easy to spot a few bad submissions in a large pool of good ones. • Manual evaluation by a trusted worker may be the only way to guarantee high quality. • Request multiple annotations • When the task is opinion-based. E.g. “Select good examples of a category?” • Too difficult to grade. Grading required as much attention as performing the task. • Amazon Mechanical Turk is an example of crowdsourcing. We pay internet users to do small tasks, like segmenting an image. Mechanical Turk is simply a broker of tasks with large community of workers. • Amazon Mechanical Turk at a glance: • We define the task and set the pay • Workers choose to do our tasks • We accept(reject) the work and pay for the accepted work Alexander Sorokin, D.A. Forsyth Department of Computer Science University of Illinois at Urbana-Champaign 3 Volume, Speed and Cost Mechanical Turk proved to be both cheap and scalable: Use cases 2 • Annotate human pose • Align landmarks • Segment images • Annotate visual attributes • Throughput comes with scale • Large number of tasks attracts more workers • Burn-in effect: workers spend time understanding the task • Better pay attracts more workers and ensures higher quality 5 Annotation Toolkit and Future work • We developed a generic tool that easily adapts to different tasks • Simple XML definition of the annotation task • Checkboxes, landmarks, bounding boxes, polygons. • Static images for now • Instructions, source code, etc at • http://vision.cs.uiuc.edu/annotation/ • Future work: • More annotation tools for other projects. Ideas are welcome. • Real-time annotation server for a robot. • “Annotate” Matlab / OpenCV call • True online learning 6 Demo

Science-Driven Data with Amazon Mechanical Turk

Science-Driven Data with Amazon Mechanical Turk

Presentation Transcript

Ms. Turk Science Unit 2 6th

Utility data annotation via Amazon Mechanical Turk

Using Mechanical Turk for linguistic research

Amazon Mechanical Turk New York City Meet Up

Using Amazon Mechanical Turk for Product Term Annotation

Mechanical Turk and AWS Workshop

Data Driven Discovery in Science

Crowdsourcing for NLP Using Amazon Mechanical Turk and CrowdFlower Matteo Negri and Yashar Mehdad

Data-Driven Conversations With Students

Amazon Mechanical Turk Artificial Artificial Intelligence

Mechanical Turk and AWS Workshop

Synchronous Experiments on Mechanical Turk

Amazon Mechanical Turk

Mechanical Turk

Rethinking Grammatical Error Detection and Evaluation with the Amazon Mechanical Turk

Amazon Mechanical Turk ( Mturk )

Building a Persistent Workforce on Mechanical Turk for Multilingual Data Collection

Running Experiments on Mechanical Turk: Day 1

7 TeV data with data driven method

Crowdsourcing with Amazon Mechanical Turk

Digital Preservation in Data-Driven Science

Running Experiments on Mechanical Turk: Day 1