Alfredo Covaleda,
Bogota, Colombia
Stephen Guerin,
Santa Fe, New Mexico, USA
James A. Trostle,
Trinity College, Hartford, Connecticut, USA
Here's one of those online sites that will keep us browsing for hours. “Information Aesthetics” weblog says it's about “form follows data – towards creative information visualiztion.” Indeed so. How about links to:
The principles are here showing how creative journalism might deliver pertinent data/information to the people. Information Aesthetics is updated often.
James Fallows column in Sunday's NYT discusses some of the frustration with keyword searching and the El Dorado of having search engines “just answer my question.” Fallows points specifically to work to develop Aquaint. The CIA, NSA and similar federal organizations are apparently quite interested in the approach initially developed at Stanford University's Knowledge Systems Lab. Of deeper interest to serious researchers (or search-tool forecasters) than Fallows' column might be the lab's research papers.
There were multiple sessions at last week's IRE convention related to online research methods and tools, reflecting the constantly dynamic nature of that activity for journos.
We recently were referred to RDN's “Virtual Training Site.” It's mission: “The Internet is a rich source of information for students, lecturers and researchers. The RDN Virtual Training Suite tutorials teach the key information skills for the Internet environment. Learn how to use the Internet to help with your coursework, literature searching, teaching and research.“ The site's organization is uncommonly arranged by topic and academic discipline instead of search engines. While there is no category for journalism, per se, many of the disciplines we utilize are there and worth a look. There are some fine tools here for educators, both in the classroom and the newsroom.
Floyd J. McKay, a journalism professor emeritus at Western Washington University, and a regular contributor to the Seattle Times editorial pages, suggests that today's journalism students lack the right stuff to do difficult reporting. In “The hardscrabble roots of investigative journalism,” he says: “Journalism students, at least in my experience, are less interested in hard-scrabble reporting and more interested in supporting roles.” He also says: “…The cost of uncovering a big story can be stupendous, often involving lawyers and computer experts as well as reporters, photographers and editors.
Most papers would rather spend the money on airplane tickets to cover their region's NFL or NBA teams, or so entertainment writers can make pilgrimages to Hollywood. These investments are more likely to attract readers, which in turn attract advertising dollars. The intensely bottom-line newspaper chains rarely appear on the honor roll, but always appear at the top of the profit-margin charts.
More of these investigative awards are won through the use of computer-assisted reporting, often involving the use of complex databases. A prize-winning team typically includes at least one journalist who specializes in this work, and often another who specializes in displaying the product graphically.”
It's good to see the word “taxonomy” creeping into the newsroom. And the AP is looking for someone who can make them. Here's the job posting: TAXONOMY DEVELOPER The Associated Press New York, NY
In a rapidly evolving technological environment, the Taxonomy Developer will collaborate with journalists, technologists, product specialists and news librarians to coordinate taxonomy creation, development and maintenance across media types and products, with the goal of aiding in the efficient retrieval and distribution of information.
The Taxonomy Developer for the Associated Press will develop taxonomies as well as create the taxonomy management and implementation strategy for AP's content delivery.
Responsibilities The taxonomy developer will help define overall AP Taxonomy Integration Strategy for content classification, delivery and user experience; work with Subject Matter Experts (SMEs) on editorial, technical and product teams to develop taxonomy implementation, process and management strategy; and help evaluate and work with appropriate tools for taxonomy management, data collection/analysis and surfacing of new terminology.
In addition, duties will include selection and prioritization of appropriate taxonomy domains. This includes developing taxonomies for new and existing products; selecting allowed values lists for proper names, products and companies; creating extensions and qualifiers to integrate AP's taxonomic scheme to external standards (ISO, SIC/NAICS, etc.); and working with and extending NewsML, IPTC News Codes and NITF. This person will work closely with the editorial, technical and product teams ensuring the taxonomies are usable and will develop and manage automated, semi-automated and manual processes for gathering taxonomy data, including adding terms, synonyms, aliases and new relation types as needed.
The Taxonomy Developer will work as part of a dynamic, multi-disciplinary team that is creating multimedia news and information products for AP and bringing them to market.
Qualifications include: 1) familiarity with industry standards groups, such as ISO, SIC/NAIC, 2) understanding structural metadata standards for content classes and entity extraction, 3) ability to validate usability of taxonomies with internal user groups (editorial teams) as well as external audiences, 4) expertise with taxonomy management and data collection/analysis, 5) surfacing of new terminology, 6) familiarity with Search and Auto Classification tools (Autonomy http://www.autonomy.com/content/home/ and Teragram http://www.teragram.com/); Text extraction tools (InXight http://www.inxight.com/); Taxonomy/Ontology maintenance tools (SchemaLogic http://www.schemalogic.com/ and Teragram http://www.teragram.com/)
MLIS degree or 3 years experience preferred.
For consideration, please send cover letters and resumes to taxonomy@apjobs.org
The Associated Press is an Affirmative Action/Equal Opportunity Employer.
Interesting article on The Virtual Chase, a web site dedicated to “teaching legal professionals how to do research.” For the details, see “”How To Conduct a Background Check.”
Nils Mulvad, one of the early champions of analytic journalism in Europe and founder of the Danish Institute for Analytic Reporting, demo-ed a fast web-scrapping tool at the IRE conference this week. Web-scrapping? It’s a way to get just the data you need from a web site that has a dynamic search engine. The FECinfo site is an example: the user enters the search terms and the site’s server returns the desired results.
As a one-off, that works OK. But what if you need all the data on the server? Turn to “RoboSuite.” It’s a point-and-shoot, build-your-own-script application. A good PERL coder can do the same thing, of course, but if you can afford it, RoboSuite is a fast solution to data harvesting.
Paul Walmsley, a programming wiz at IRE, has developed a neat PERL script for doing a bit of Social Network Analysis online at the IRE site.
“JustLooking” is a members-only tool that has been up for a year, Walmsley said, but lacking publicity, it’s been pretty much backstage. The app is a relatively basic, yet impressive tool whose results are designed to be integrated/imported into UCInet, an early SNA tool.
“JustLooking” comes, so far, with two network templates to save time in common situations. * Campaign Finance: for tracking campaign dollars * Rolodex: for entering basic networks of people and organizations
Dig out your IRE membership number and check it out.
One of the interesting challenges for journalists and public health professionals is figuring out how to compare, and visualize, health care statistics in a demographic and geographic environment. Yeah, that's one of the things that epidemologists are supposed to do every day. But it ain't easy. In the current issue of ArcUser, Chakib Battioui, of the University of Louisville, Kentucky, has written an interesting article on “Calculating Health Disparity Indexes.” “Socioeconomic indexes are strongly believed to be associated with the risk of disease. However, there is no consensus in the United States regarding which area-based measure should be used to assess socioeconomic inequalities in health…. “To study the relationship between the rate of cervical cancer and economic status, the project used the Socio Economic Risk Index (SERI). SERI classifies people in public databases based on residential neighborhood characteristics and permits the calculation of population-based rates stratified by location…. “There are technical and conceptual obstacles to the adoption of area-based measures for public health. Currently, there is no consensus in the United States regarding which area-based measures should be used and what level of geography should be used to measure or monitor socioeconomic inequalities in health.”
The article is worth checking out because of the methodology's potential for application to other types of data.
Better Access to Public Health Infomation The same issue of ArcUser also carries an article by our old friend Bill Davenhall, of ESRI. His topic is as broad as the sub-hed above, but the accompanying map is especially interesting. Its caption: “Facing a flu vaccine shortage for the 2004-2005 flu season, Nebraska public health officials rapidly determined both the current vaccine supply and the anticipated demand using GIS.”
We're told that there might well be another flu vaccine shortage this coming winter. Heads up journos are starting to think now about how to cover — and illustrate — THAT story.
From the good ol' Librarians' Index to the Internet comes a good site/toolbox for learning and teaching stats. “The Claremont Colleges' “Web Interface for Statistics Education” (WISE) seeks to expand teaching resources offered through Introductory Statistics courses, especially in the social sciences. This project aims to develop an on-line teaching tool to take advantage of the unique hypertextual and presentational benefits of the World Wide Web (WWW). This teaching tool's primary application is as a supplement to traditional teaching materials, addressing specific topics that instructors have difficulty in presenting using traditional classroom technologies. The tool serves to promote self-paced learning and to provide a means for advanced students to review concepts.”