| Sun | Mon | Tue | Wed | Thu | Fri | Sat |
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| 8 | 9 | 10 | 11 | 12 | 13 | 14 |
| 15 | 16 | 17 | 18 | 19 | 20 | 21 |
| 22 | 23 | 24 | 25 | 26 | 27 | 28 |
| 29 | 30 |
« Adventures in Identification III: The Indiana Jones of Economics | Main | IN, NC Rehash; WV Prediction »
13 May 2008
I recently came across Datamob.org, a site featuring public datasets and interfaces that have been built to help the public explore them.
From datamob's about page:
Our listings emphasize the connection between data posted by governments and public institutions and the interfaces people are building to explore that data.It's for anyone who's ever looked at a site like MAPLight.org and wondered, "Where did they get their data?" And for anyone who ever looked at THOMAS and thought, "There's got to be a better way to organize this!"
I continue to wonder how the types of interfaces featured on datamob will affect the dissemination of information in society. The dream of a lot of these interface builders is to disintermediate information provision -- ie, to make it possible for citizens to do their own research, produce their own insights, publish their findings on blogs and via data-laden widgets. (We welcomed Fernanda and Martin from Many Eyes, two prominent participants in this movement, earlier this year at our applied stats workshop.) At the same time, the new interfaces make it cheaper for professional analysts -- academics, journalists, consultants -- to access the data and, as they have always done, package it for public consumption. It makes me wonder to what extent the source of our data-backed insights will really change, ie, how much more common will "I was playing around with data on this website and found out that . . . " become relative to "I heard about this study where they found that . . ."?
My hunch is that, just as blogging and internet news has democratized political commentary, the new data resources will make it possible for a new group of relatively uncertified people to become intermediaries for data analysis. (I think FiveThirtyEight is a good example in political polling, although since the site's editor is anonymous I can't be sure.) People will overwhelmingly continue to get data insights as packaged by intermediaries rather than through new interfaces to raw data, but the intermediaries (who will use these new services) will be quicker to use data in making their points, will become much larger in number, and will on average become less credentialed.
Posted by Andy Eggers at May 13, 2008 9:48 AM
Just simple spreadsheets and charts, but I've pulled together some interesting government statistics at: http://www.supportingevidence.com/Government/government_landing.html.
Check out the pareto chart on U.S. Government Agency budgets. It really makes our investment priorities crystal clear.
Scott Gibson
www.supportingevidence.com
'worth a thousand words'
Posted by: Scott Gibson at May 15, 2008 2:16 AM
I think your reference to "relatively uncertified" intermediaries may be missing several points. First, there are many intermediaries of public data in the mainstream media. Are they certified or uncertified, and by what standard? Second, there are many purveyors of public data working in government agencies. Are they automatically to be considered "certified" if they are reporting agency-generated data (even if it often highly selected or even propagandistic)? Third, there are many intermediaries in the commercial world (financial analysts, advertising and sales professionals) who draw on public data to interpret market conditions or make a sales pitch. On what basis can one classify them as certified or uncertified? Fourth, there are many intermediaries in the area of sports statistics -- for example, sabermetricians in baseball such as Bill James and Tom Tango. These are among the most credible people in the statistical analysis of baseball. Virtually none is "certified" in the field but many have sterling reputations among their peers.
Long story short, I don't know what "certified" means or that it is even a useful term. What regulates the quality of the intermediation of public data is critical reading and "the market of ideas" (competitive analysis and reinterpretation of accessible public data). And that doesn't only happen in professional journals with a peer review process. It happens in the blogosphere as well, as analysts develop credibility among their readers (or competing bloggers) for their insightful or persuasive analysis of public data.
Posted by: Will at May 15, 2008 8:53 AM
Scott, your site looks interesting. Have you thought about sharing your work on Many Eyes, either instead of your site or in addition to it? You can post your data, create visualizations, and write commentary on it -- and then get comments from others, even their visualizations of your data. You could then embed your visualizations on SupportingEvidence if you'd like. It just seems better to take advantage of and try to build an existing community (with great tools!) than to be out on your own like that.
Will, you're right, I wasn't clear what I meant by "certified." I think basically I meant "credentialed," in the sense that some institution (university department, think tank, consulting company, media outlet) has bestowed the seal of approval. I definitely did not meant to say that people without those credentials are not credible. My point is that when the cost of accessing data and publishing analysis about it goes down, more of the intermediaries will be people without these credentials. You're right that these new intermediaries develop reputations and ultimately new credentials (e.g. Poblano, blogger for DailyKos) so it's not quite right to say that the new intermediaries are "uncredentialed." I just meant that intermediaries would be coming from new places. And I totally agree with you that the diminishing importance of the old credentials (eg academic journals) is a good thing -- it's progress when the market of ideas is working.
Posted by: Andy Eggers
at May 15, 2008 10:22 AM