Alfredo Covaleda,
Bogota, Colombia
Stephen Guerin,
Santa Fe, New Mexico, USA
James A. Trostle,
Trinity College, Hartford, Connecticut, USA
Here at the IAJ, we have long been a fan — and user — of Prof. Robert Sahr's “Inflation Conversion Factors” web site and tools. We were snoozing at the switch a bit and didn't notice that Sahr updated the site in early June 2009. Check it out: it's filled with both PDF and Excel data/tools to calculate the comparative costs of most anything from 1774 to 2019. The site also includes some fun data:
Selected Government-related Items (number of government employees, Social Security and AFDC/TANF; stamp prices, minimum wage, mean and median family income)
National Government Budget: Outlays (Spending), Revenue, Deficits or Surpluses, and National Debt
This tip comes from our friend Stephan Russ-Mohl, of the European Journalism Observatory.
European Journalism Observatory, September 26, 2009
Many journalists face difficulties in dealing with statistics, and frequently lack the competence to present quantitative information to their publics in easy-to-grasp language.
This is nothing new, as most journalism textbooks contain tips on how to deal adequately with numbers and percentages. Thus far, these remain rules of thumb. Three U.S. researchers – Coy Callison, Rhonda Gibson and Dolf Zillmann – recently tested these rules. Drawing from 240 students who participated in an experiment, their empirical analysis provides new, surprising insights.
The test subjects could deal as easily with percentages as with absolute numbers. Contrary to textbooks, they experienced more difficulties when percentages were characterized verbally. For example, “30 percent of the citizens of XY have AIDS” has a meaning different from “3000 of the citizens of XY have AIDS” – but the percentage is no more difficult to grasp and remember. If, instead, the percentage is transformed into the phrase “Three of ten of the citizens of XY have AIDS,” an additional barrier of comprehension is created. The researchers mention, however, that their experiments should be expanded, and that less educated groups still need to be included.
Coy Callison et al.: How to Report Quantitative Information in News Stories, Newspaper Research Journal, Vol 30/Nr. 2, Spring 2009, 43-55.
We don't know how long this one's been around, but kudos to the gang at the WashPost for taking a concept/tool somewhat on the margin and putting it to good use for both reporters and readers. See http://projects.washingtonpost.com/potus-tracker/
Every day President Obama meets with key members of his administration, Congress, foreign dignitaries, interest groups and regular citizens. Use our interactive database to track how Obama is spending his time, what issues are getting the most attention and who is influencing the debate. Subscribe to daily schedule via RSS.
Nathan, over at FlowingData, has posted a fine example of infographics. The work of Willard C. Brinton is a nice extension of what was being done by U.S. government agencies. Turns out, Brinton's book can be found in used book sites, and at an affordable price.
Someone needs to get me a paper copy of Willard Cope Brinton's Graphic Presentation (1939), because it is awesome.
Brinton discusses various forms of graphic presentation in the 524-page book and what works and what doesn't. There's also some good stuff in there about how to make your graphs, charts, maps, etc (by hand).
The most interesting part is that many of the graphics – despite having no computers in 1939 – look a lot like what we have today. Albeit, they're a little rougher because they're made by hand, but that's just added flavor.
For example, you've got the Sankey diagram above, or a “cosmograph” as Brinton calls it. The instructions read:
One thousand strips of paper are set on edge to represent 100% and are separated into component parts of 100%.
What? You want me to arrange 1,000 strips of paper to make my diagram? Brilliant, I say.
Here are your choropleth maps…
network diagram…
and of course some of your usual suspects…
The entire book is freely available in PDF format, but it's low resolution and takes forever to browse. Michael Stoll has posted some higher quality shots on Flickr.
I still want more though.
Seriously, does anyone know where I can get a copy?
[via Datavisualization.ch]
Like what you see? Subscribe to the FlowingData RSS feed to stay updated on what's new in data visualization.
FlowingData passes along the link to this fine piece of work by Ben Fry. “Ben Fry Visualizes the Evolution of Darwin’s Ideas” Journos could be using a similar approach to analyze the evolution of the ideas of public officials.
“Ben Fry, well-known for Processing and plenty of other data goodness, announced his most recent piece, On the Origin of Species: The Preservation of Favoured Traces, made possible by The Complete Work of Charles Darwin Online.
The visualization explores the evolution of Charles Darwin's theory of, uh, evolution. It began as a less-defined 150,000-word text in the first edition and grew and developed to a 190,000-word theory in the sixth edition.
Watch where the updates in the text occur over time. Chunks are removed, chunks are added, and words are changed. Blocks are color-coded by edition. Roll over blocks to see the text underneath.
As usual, excellent work, Mr. Fry.”
Mary Ellen Bates offers up this good tip on “Google Squared” at Bates Information Services, www.BatesInfo.com/tip.html ________________________________________________________________________________________
Google Labs — the public playground where Google lets users try out new products or services that aren't yet ready for prime time — is my secret weapon for learning about cool new stuff. My favorite new discovery in Google Labs is Google Squared. It's a demonstration of a search engine trying to provide answers instead of just sites, and at a higher level than the simple “smart answers” you see when you search for “time in Rome” or “area code 909”. Rather, Google analyzes the retrieved pages, identifies common elements, and creates a table with the information it has compiled.
This is a fascinating tool that helps you compile facts into tables that Google builds on the fly. Hard to describe, easier to show. Go to Google.com/squared and type in a query that will retrieve a number of similar things — organic farms in Colorado, for example, or women CEOs… even superhero powers.
Google Squared generates a table of facts extracted from its index, with the items you are searching for as the left-most column, along with columns for whatever related characteristics are relevant for the topic. For organic farms in Colorado, for example, the table in the search results has columns for the name of the company, an image from the farm's web site, a snippet of description about the farm, and columns for telephone number, location and “season.” Note that some of these columns may have few entries in them, depending on what information Google analyzed. For women CEOs, the table includes the CEO's name, a photo, a snippet that indicates what her position is, her date of birth, and her nationality. For superhero powers, you will find the superhero's name, a photo, a far-too-brief description of said superhero, the hero's first appearance (in print, that is), publisher and even the hero's “abilities”.
Interestingly, you can insert your own items in a Google Squared table, and either let Google populate the rest of the row or type in whatever content you want in that row. I added Catwoman to my superheroes table and Google filled in the new row with her photo and description; I could provide the rest of the info. For some tables, Google even suggests additional columns. For my superheroes table, I could add columns for Aliases, Alter Ego, Profession (the Joker is a lawyer, of course), and so on. You can add your own columns, as well.
You can also delete a row or column that isn't relevant to your search. If you log in to your Google account, you can save your customized tables for later use. And you can export the table into Excel (the images are exported as URLs).
Google Squared is never going to compete with a real human's analysis of a collection of facts, but it can be a great way to start brainstorming, as a quick way to organize the results of your search, and as a starting point for a nicely-presented deliverable for your client.
“May I publish or reproduce this InfoTip?” Be my guest! Just make sure you credit the source, Bates Information Services, and include the URL, www.BatesInfo.com/tip.html
OK, OK. Using R can be a steep hill to climb for some. But here, thanks to O”Reilly Radar, is a pretty good video of a presentation on using R as a Social Network Analysis tool.
“Social Network Analysis in R — video and slides for talk on doing social network analysis with R.”
Update II: It occurred to me that it would be much better for people to be able to view the entire talk in a single video, rather than having to switch between sections; therefore, I uploaded the whole thing to Vimeo.
Tonight I will be givingOn August 6th I gave a talk at the New York City R Meetup on how to perform social network analysis in R using the igraph package. Below are the slides I will be going over covered during the talk, and all of the code examples from the presentation are available in the ZIA Code Repository in the R folder.
Below is a video of this talk, with a link to the slides I review during the presentation. If you are interested, I suggest downloading the slides and following along with videos while having the slides open, as much of what is on the screen in the video is hard to read.
Social Netowork Analysis in R from Drew Conway on Vimeo.
Andrew Little’s presentation on econometrics in R using Zelig and MatchIt are also available on YouTube starting here. I hope you enjoy the presentation, and please let me know if you have any questions or comments.
Good NYTimes profile of NYU/Hoover Institute professor Bruce Bueno de Mesquita, who has spent 40+ years developing predictive models of socio-political activity. (Also a nice bit of promo for “The Predictioneer’s Game,” Bueno de Mesquita's book scheduled to come out next month.)
“Of course, a somewhat high profile always proves to be an attractor. For example, see “The New Nostradamus.” “Can a fringe branch of mathematics forecast the future? A special adviser to the CIA, Fortune 500 companies, and the U.S. Department of Defense certainly thinks so. “If you listen to Bruce Bueno de Mesquita, and a lot of people don’t, he’ll claim that mathematics can tell you the future. In fact, the professor says that a computer model he built and has perfected over the last 25 years can predict the outcome of virtually any international conflict, provided the basic input is accurate. What’s more, his predictions are alarmingly specific. His fans include at least one current presidential hopeful, a gaggle of Fortune 500 companies, the CIA, and the Department of Defense. Naturally, there is also no shortage of people less fond of his work. “Some people think Bruce is the most brilliant foreign policy analyst there is,” says one colleague. “Others think he’s a quack.”
“Of course, a somewhat high profile always proves to be an attractor. For example, see “The New Nostradamus.”
“Can a fringe branch of mathematics forecast the future? A special adviser to the CIA, Fortune 500 companies, and the U.S. Department of Defense certainly thinks so.
“If you listen to Bruce Bueno de Mesquita, and a lot of people don’t, he’ll claim that mathematics can tell you the future. In fact, the professor says that a computer model he built and has perfected over the last 25 years can predict the outcome of virtually any international conflict, provided the basic input is accurate. What’s more, his predictions are alarmingly specific. His fans include at least one current presidential hopeful, a gaggle of Fortune 500 companies, the CIA, and the Department of Defense. Naturally, there is also no shortage of people less fond of his work. “Some people think Bruce is the most brilliant foreign policy analyst there is,” says one colleague. “Others think he’s a quack.”
Still, we think the articles and approach are well-worth your reading time.
FYI from O'Reilly Radar.
And does this suggest possibility of something like “distributed data analysis” whereby a number of widely scattered watchdogs could be poking into the same data set? If so, raises interesting questions for journalism educators: who is developing the tools to manage such investigations?
Enabling Massively Parallel Mathematics Collaboration — Jon Udell writes about Mike Adams whose WordPress plugin to grok LaTeX formatting of math has enabled a new scale of mathematics collaboration.
http://blog.jonudell.net/2009/07/31/polymath-equals-user-innovatio/
===============================================
Polymath = user innovation Posted by Jon Udell under Uncategorized [9] Comments In February 2007, Mike Adams, who had recently joined Automattic, the company that makes WordPress, decided on a lark to endow all blogs running on WordPress.com with the ability to use LaTeX, the venerable mathematical typesetting language. So I can write this: $latex \pi r^2$ And produce this: When he introduced the feature, Mike wrote: Odd as it may sound, I miss all the equations from my days in grad school, so I decided that what WordPress.com needed most was a hot, niche feature that maybe 17 people would use regularly. A whole lot more than 17 people cared. And some of them, it turns out, are Fields medalists. Back in January, one member of that elite group — Tim Gowers — asked: Is massively collaborative mathematics possible? Since then, as reported by observer/participant Michael Nielsen (1, 2), Tim Gowers, Terence Tao, and a bunch of their peers have been pioneering a massively collaborative approach to solving hard mathematical problems. Reflecting on the outcome of the first polymath experiment, Michael Nielsen wrote: The scope of participation in the project is remarkable. More than 1000 mathematical comments have been written on Gowers’ blog, and the blog of Terry Tao, another mathematician who has taken a leading role in the project. The Polymath wiki has approximately 59 content pages, with 11 registered contributors, and more anonymous contributors. It’s already a remarkable resource on the density Hales-Jewett theorem and related topics. The project timeline shows notable mathematical contributions being made by 23 contributors to date. This was accomplished in seven weeks. Just this week, a polymath blog has emerged to serve as an online home for the further evolution of this approach.
Posted by Jon Udell under Uncategorized [9] Comments
In February 2007, Mike Adams, who had recently joined Automattic, the company that makes WordPress, decided on a lark to endow all blogs running on WordPress.com with the ability to use LaTeX, the venerable mathematical typesetting language. So I can write this:
$latex \pi r^2$
And produce this:
When he introduced the feature, Mike wrote:
Odd as it may sound, I miss all the equations from my days in grad school, so I decided that what WordPress.com needed most was a hot, niche feature that maybe 17 people would use regularly.
A whole lot more than 17 people cared. And some of them, it turns out, are Fields medalists. Back in January, one member of that elite group — Tim Gowers — asked: Is massively collaborative mathematics possible? Since then, as reported by observer/participant Michael Nielsen (1, 2), Tim Gowers, Terence Tao, and a bunch of their peers have been pioneering a massively collaborative approach to solving hard mathematical problems.
Reflecting on the outcome of the first polymath experiment, Michael Nielsen wrote:
The scope of participation in the project is remarkable. More than 1000 mathematical comments have been written on Gowers’ blog, and the blog of Terry Tao, another mathematician who has taken a leading role in the project. The Polymath wiki has approximately 59 content pages, with 11 registered contributors, and more anonymous contributors. It’s already a remarkable resource on the density Hales-Jewett theorem and related topics. The project timeline shows notable mathematical contributions being made by 23 contributors to date. This was accomplished in seven weeks.
Just this week, a polymath blog has emerged to serve as an online home for the further evolution of this approach.
Here's the Amazon link to Few's new book: Simple Visualization Techniques for Quantitative Analysis
Stephen Few is the author of Information Dashboard Design: The Effective Visual Communication of Data, Show Me the Numbers: Designing Tables and Graphs to Enlighten, and the monthly Visual Business Intelligence newsletter. He has worked for more than 25 years as an information technology innovator, teacher, and consultant. As the principal of the consultancy Perceptual Edge, he focuses on practical uses of data visualization to explore, analyze, and present quantitative business information. He lives in Berkeley, California.
Product Description This companion to Show Me the Numbers teaches the fundamental principles and practices of quantitative data analysis. Employing a methodology that is primarily learning by example and “thinking with our eyes,” this manual features graphs and practical analytical techniques that can be applied to a broad range of data analysis tools—including the most commonly used Microsoft Excel. This approach is particularly valuable to those who need to make sense of quantitative business data by discerning meaningful patterns, trends, relationships, and exceptions that reveal business performance, potential problems and opportunities, and hints about the future. It provides practical skills that are useful to managers at all levels and to those interested in keeping a keen eye on their business.