Matt Zeitlin is
worried about the future of journalism:
It’s widely known that lots of journalists are innumerate. And while a lot of people are innumerate, and journalists are just that, people, it becomes a problem when it comes to reporting economic news and data. A popular mistake is reporting nominal dollar figures instead of inflation adjusted figures when comparing wages or sales in two given years. Or you get the horrible mangling of simple statistical comments like significance or margin of error. The point is, journalists need to know math! Or at least some very basic statistics and economics.
One problem, I feel, is one of cultures. Most journalists studied humanities in college, and best I can tell, people who study the humanities are largely the same people who say “Thank God I don’t have to take any math classes again.” And while you don’t actually need to know any college math to be a good journalist, you really do need to know some statistics, or more generally, quantitative methods.
At Medill, Northwestern’s Journalism school, they require students to take some sort of Math for journalists class. And, if a friend of mine’s facebook status — since when does journalism involve math? seriously? — is any indication, there is still a lot more work to do.
My feelings exactly. Most of the time, I feel compelled to restrain myself in commenting on the matter, since I risk sounding like the obnoxious math geek who tells English students they're just not smart enough to take real classes. But now that Matt Zeitlin (a bona fide philosophy major) is talking about it, I might as well enter the fray.
He hits the nail on the head by pointing out that
college math isn't necessary to perform competently as a journalist. Most
high school math isn't important, for that matter. You don't need to remember the quadratic formula, know how to divide complex numbers, or understand whatever the hell the law of cosines is. If you started disliking math because all the symbols and equations scared you, it doesn't matter: as long as you can grasp basic quantitative concepts, you can be a perfectly capable journalist.
What do you need to learn? Getting confused between real and nominal statistics, as Matt mentions, is indefensible. You'd think that this would be the first piece of training given to anyone reporting economic news:
whatever you do, don't say that wages have increased because inflation has increased their nominal value! The difference between a mean and median isn't too difficult either. A lot of smart young bloggers that aren't exactly math geeks have already figured this out -- why can't writers at leading national newspapers?
In the spirit of progress, I therefore present you with the first installment of Remedial Statistics for Journalists. It's hardly complete, and might be better titled A Random Collection of Matt's Pet Peeves, but it's good to let out.
Polling.
If a poll's margin of error is 5% and Obama polls 52%, this result is not "statistically indistinguishable from a tie." You'd think that this would be evident using a little basic logic: if 52% is statistically "indistinguishable" from 53%, which is in turn the same as 54%, and so on, transitivity implies that 52% is indistinguishable from 100%, which is clearly false. Alas, journalists' desire to display what they perceive to be statistical sophistication gets ahead of them here, as a snappy-sounding declaration that two numbers are "statistically indistinguishable" impresses the editor enough to get by. Yes, 52% is very close to 50%, and there's a substantial chance that the real figure is below 50% -- but even with this very narrow margin, it's more likely that the actual figure is above 50%, meaning that the poll result is hardly "equivalent" to a tie.
Indeed, the "margin of error" is nothing special at all. It's based on the arbitrary choice of a 95% confidence interval, which in turn only deals with easy-to-quantify sampling error. That's the error you get when you select 1000 voters, in a perfectly random way, from the population; even if your method of selection is perfect, there's inevitably some error resulting from the fact that you haven't sampled the entire population. As it happens, this isn't the most serious error you get in polling, because it can be aggregated away: the average of 4 polls, each with a 5% margin of error, no longer has a 5% margin of error. (Journalists tend to be obtuse here as well, and often talk about how averages of many polls are within the "margin of error," even when the margin of sampling error is much smaller in the aggregated sample.)
The real danger in polling is that the method for selecting the sample is flawed, a danger that lies outside the "margin of error" concept altogether. Maybe you're oversampling elderly white women, or disaffected former construction workers. Indeed, pollsters find this to be such a problem that they implement ad-hoc demographic "weighting" procedures to correct for it. But even if you've pinpointed the percentages of every racial, gender, and age category in the voting pool -- no mean feat, because these percentages fluctuate from year to year -- you're left with the possibility that you'll systematically oversample voters of a particular political persuasion, even if you've compensated using all the obvious demographic markers. In this sense, the margin of sampling error (the one you see on press releases) is actually a lower bound on the true margin of error, and depending on the pollster can be a very serious underestimate indeed.
Hopefully that's been a nice lesson -- unfortunately, statistics is about a lot more than just polling. Many times, you'll end up reporting how "researchers have found" that hugging seals causes chronic obesity, or some other amusing nugget from the world of science that makes good copy. How do these mysterious "researchers" come to such conclusions? Well, one way is...
Randomized Experiments
When they work, randomized, controlled experiments are a great way to identify differences and causal effects. Want to know whether a drug actually helps patients with hypertension? Simple: give some subjects the actual drug, some subjects the placebo, and see what happens. After I confidently declared that there was no real difference between Busch and Bud Light, a few of my friends and I tried this last week, and our blind, randomized evaluation found that unless you're much better at tasting cheap beer than we are, the extra $5 for a 24-pack is completely useless. (Statistics in action!)
Alas, randomized experiments aren't the answer to everything. Their results might be internally consistent, but often they only tell you that a smattering of coerced psychology undergrads pressed a red button after being told to think about trucks. You need to know whether the participants are a representative cross-section of the population under study, whether the sample size is large enough to justify bold claims about its implications, and whether the experimental design introduces any biases into the research.
Sometimes experiments are great (medicine). Sometimes they're shaky (psychology). Sometimes they barely even apply (economics). Understand the difference.
Regression
This is researchers' main tool to draw statistical conclusions when they can't run a randomized experiment. The idea is to measure how a "dependent variable" is influenced by a number of "independent variables." The simplest and most commonly used assumption is that the effects are linear: that, say, an increase of 1 in X is associated with an increase of 2 in Y. A standard technique, called "ordinary least squares," allows you to place a number on each effect. Never mind how it works mathematically -- it's not too complicated, but software packages do all the work anyway, and any formulas are likely to distract you.
The key is to understand the limitations of regression analysis. First, it doesn't magically overcome the fact that correlation is not causation. If you regress income on education, you don't know whether education is causing people to earn more (probably), better financial circumstances allow people to educate themselves longer (probably, albeit to a much lesser degree), or whether they're both tied up in a swirl of complicated demographic and social factors.
Any variable that's relevant but not included in a regression equation can distort the results. Does increased time spent playing video games actually result in lower grades, or is heavy gaming just an indicator of low academic motivation (something that's fuzzy and difficult to measure in a study)? You can try to add more variables to the model, "controlling" for important factors, but omitted variables will inevitably still present a problem, and sometimes the overzealous addition of controls can actually
make the problem worse.
Tons of researchers run regressions without understanding what they really mean, or how problems like omitted variable bias or endogeneity (a fancy way of saying that causation runs in multiple directions) can distort the results. Ask yourself: does this study's approach really prove that X causes Y? If the researchers only claim that two variables "correspond" or "are associated," don't oversell their findings, and if they claim causation, apply a little critical thinking to see how credible that claim is. Sometimes these problems are unavoidable, and we have to rely on whatever flawed research we have, but there's a lot of nonsense spread by journalists' overeager pens.