My Photo

« Making Money With Web Services | Main | AWS Solutions Catalog »

Mechanical Turk for Metadata Collection

Esv_blog There are some fascinating posts over at the ESV blog. ESV refers to the English Standard Version of the Bible.

In order to increase the accuracy of their database of Biblical quotations, they used the Amazon Mechanical Turk. The HIT was relatively simple, and asked the Worker to identify the name of the person who uttered each quote in exchange for a payment of 2 cents. The first set of HITs was uploaded as a test of the speed and quality of the Mechanical Turk workforce. You can read the description of the work here.

A followup post recaps the experiment and describes the results. 3,100 quotations were uploaded using a Perl script. 78 workers responded to their invitation and dove right in. Since this was a test, the folks at ESV already knew the right answer for each HIT. A first-check direct string comparison let them approve 85% of the submissions automatically. Further hand checking pushed the approval rate all the way up to 98.3% -- they rejected just 54 (1.7%) of the submissions.

The blog post contains some fascinating statements about the process here are some of my favorites:

  • "Computers can’t do everything."
  • "Mechanical Turk presents a new and helpful way to spread the work inexpensively among many people."
  • "We got a database for about $75 that, as far as we can tell, no one has created before for the Bible."
  • "We estimate that Mechanical Turk cut our costs by about 60% for a comparable-quality result."
  • "Workers performed these HITs almost as fast as they were uploaded."

Hard to argue with any of these; we've been talking about the use of Mechanical Turk for quality control, metadata collection, and text annotation for a while now.

-- Jeff;

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341c534853ef00d83463866269e2

Listed below are links to weblogs that reference Mechanical Turk for Metadata Collection:

» Mechanical Turk Recap from ESV Bible Blog
Our experiment with Mechanical Turk went better than we expected. We were able to approve 85% of submissions automatically, and we ultimately approved 98.3% of submissions. These figures came in higher than we planned: we thought we would approve 80% ... [Read More]

Comments

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.

Email Subscription

Enter your email address:

Delivered by FeedBurner

July 2009

Sun Mon Tue Wed Thu Fri Sat
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31