Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Transcribable: Free the Files to Go!
#1

The ProPublica Nerd Blog

Secrets for Data Journalists and Newsroom Developers






Transcribable: Free the Files to Go!

by Al Shaw
ProPublica, July 16, 2013, 10:42 am




Today we're releasing a new open source project, which will enable any organization with a DocumentCloud account to do crowdsourcing using documents.

Since we wrapped up our Free the Files project after last year's U.S. election, many people and organizations have asked us how they could build their own web applications like Free the Files to crowdsource their caches of documents. The full Free the Files codebase is undocumented, a bit messy and isn't easy to deploy in environments other than our own, so we decided to extract the salient bits into a Rails plugin we're callingTranscribable.
Transcribable allows you to drop a RubyGem into your Rails app, and instantly add "transcribability" to any attribute on a given model. So, for example, if you have a Filingmodel, and you'd like users to be able to transcribe buyers and amounts, you could write:

class Filing < ActiveRecord::Base transcribable :buyer, :amountendOnce you've defined which details you'd like the crowd to help you find, there is a generator that writes the rest of the code for you, including a beautiful "casino-driven" transcription form with automatically created fields for your attributes. That page looks something like this:
[Image: transcribable-thumb.jpg]
To make sure your crowdsourced data is accurate, Transcribable will "verify" your users' transcriptions by comparing multiple users' answers, and then committing only the agreed upon ones to the master model. Within your model, you can set a threshold over which you'd like users to agree on a filing's attributes, and Transcribable does the rest.
The RubyGem also comes with a few other cool features such as a task that will slurp all the documents in a DocumentCloud project into your database to await transcription. It also lets you specify fields you'd like users to fill out, but not necessarily verify (for things like notes).
To start using Transcribable, just drop it into your Gemfile as you normally would. Instructions for that, and more of the nitty gritty, are available in the documentation on the Github page.
Happy crowdsourcing!
http://www.propublica.org/nerds/item/tra...iles-to-go
"The philosophers have only interpreted the world, in various ways. The point, however, is to change it." Karl Marx

"He would, wouldn't he?" Mandy Rice-Davies. When asked in court whether she knew that Lord Astor had denied having sex with her.

“I think it would be a good idea” Ghandi, when asked about Western Civilisation.
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  PRISM/NSA/Etc. -Free Search Engine?.....maybe...they seem to be trying. Peter Lemkin 4 16,064 22-05-2014, 01:21 AM
Last Post: Magda Hassan
  New Free Legal Academic Torrent Site Just Begun Peter Lemkin 3 26,291 08-02-2014, 08:57 AM
Last Post: David Guyatt
  NASA's Massive Free E-Book Collection Magda Hassan 1 36,260 13-08-2013, 12:17 AM
Last Post: Magda Hassan
  The FBI Files - An unofficial Repository of History Magda Hassan 0 12,099 12-08-2013, 01:47 PM
Last Post: Magda Hassan
  Syrian Files Wikileaks Magda Hassan 0 4,274 05-07-2012, 11:21 AM
Last Post: Magda Hassan
  2,619 CIA Sources: The Robert Trumbull Crowley Files Magda Hassan 2 15,492 24-04-2012, 01:38 PM
Last Post: Charles Drago
  Top 9 free vpn services out there Magda Hassan 2 8,444 17-04-2012, 04:13 PM
Last Post: Magda Hassan
  LittleSis* is a free database of who-knows-who at the heights of business and government. Ed Jewett 0 4,040 07-12-2011, 07:16 AM
Last Post: Ed Jewett
  Berlin Crisis CIA Newly Declassified Documents and Free CIA Multimedia DVD-ROM Available Bernice Moore 0 3,800 08-11-2011, 05:31 AM
Last Post: Bernice Moore
  record radio off the web for free and dowload as mp3's Ed Jewett 0 4,140 23-06-2011, 02:19 AM
Last Post: Ed Jewett

Forum Jump:


Users browsing this thread: 1 Guest(s)