Do More Unequal Places Tend to Vote for Democrats?
David Frum had a great article in last week’s NYT Magazine discussing the political impacts of economic inequality. This is a subject in which I have a longstanding interest.
David Frum argued, in part, that:
As a general rule, the more unequal a place is, the more Democratic; the more equal, the more Republican.
He and Andrew Gelman have gone back and forth on this and other points in the article. In the course of this exchange, David Frum has said that he thinks it’s more useful to consider inequality vs. partisan voting at the county level rather than the state level. That is, that one should most usefully define “place” in the quoted sentence as “county” rather than “state”. This question seemed to me to amenable to empirical resolution, so I decided to see if I could take a quick look at it.
Data
I was able to find 2007 Gini coefficients, a standard inequality metric, produced by the Census for about 800 large US counties. But it occurred to me that if you read David Frum’s article, he is describing neighborhood effects rather than just individual effects. That is, if you had two equivalently unequal populations, one of which had the same (unequal) distribution of incomes in every neighborhood, and the other of which had a few very rich neighborhoods and lots of poorer neighborhoods, these two hypothetical societies might tend to vote very differently.
So I created my own additional inequality metric. The Census reports the average per capita income by Census Block Group (CBG). A CBG is a small geographic unit containing something like 1,000 – 2,000 people. There are about 211,000 CBGs in the U.S. There are about 3,000 counties or county-like political jurisdictions in the U.S. (using what the Census terms a FIPS number). So there are about 65 or so CBGs per county equivalent. By running a whole lot of spatial queries, I mapped each of the 211,000 CBGs to a FIPS county equivalent, thereby creating an average of about 65 point-estimates for average neighborhood per capita income within each county equivalent. I then calculated the standard deviation of the average 2007 per captia income across the CBGs that approximately comprise each county, and used this as an additional metric for inequality by county. This was designed to test for the kind of neighborhood effect that I referenced, but also has the advantage of providing an inequality metric that is uniformly defined for all counties, not just large ones. I calculated this same metric for the year 2000.
I used the percentage of the vote received by Bush in the 2004 election as the base measurement of Republican vs. Democrat, and used the change in the Bush vote between 2000 and 2004 as the measurement for change in Republican vs. Democrat.
Analysis
In summary, more unequal counties tend to be less Republican in recent elections.
This finding is robust against time periods, samples of counties and definitions of inequality. Examined across the large counties for which I had both Gini coefficients and the neighborhood inequality index, inequality is negatively correlated with Bush’s vote percentage in 2004 and 2000 at a significant level with both indices. This same result holds for the full slate of about 3,000 counties using the neighborhood inequality index. I calculated the within-state relationship for the counties within each of the 48 lower states, and 46 show this same directional finding. (Professor Gelman will not be surprised to learn that Connecticut and New Hampshire are the outliers.)
The neighborhood inequality index is somewhat more correlated with voting outcomes across the large county sample than are the Gini coefficients. Interestingly, the correlation between these two inequality metrics is only about 0.6, and when combined each provides incremental information. In order to keep things simple, I created a hybrid index by normalizing and adding the metrics, which performs better than either one in isolation with very high significance. Here is the graphic:

When I examined the change in inequality 2000 – 2007 vs. the change in vote 2000 – 2004, I found no significant relationship between change in inequality and change in vote in any sample.
Interpretation
Based on this analysis, David Frum is correct that unequal places are less likely to vote for Republicans at the national level. To bound this, however, I should emphasize that this analysis neither supports nor refutes any assertions about causality. Inequality may cause changes in voting behavior, but it is certainly entangled with many other factors.
Trying to build some kind of a cross-sectional model that “holds all other factors equal” is almost certainly a fool’s errand. Interactions between drivers are a central, not a peripheral, component of such a complex social phenomenon. This complexity would overwhelm 3,000 data points pretty quickly. (Professor Gelman, who is one of the best statisticians in America, is acutely aware of this generic issue.)
If anything, the observation that changes in inequality don’t correlate with changes in voting tends to undercut the argument for (simple) causality, though obviously this was pretty crude analysis – I didn’t even have well-aligned time periods, didn’t consider possible lag or confounding effects and so on.
Ideally we would have some kind of structured experiment to establish causality, but it’s pretty hard to see that happening. Barring that, the next best solution would be natural experiments (e.g., look at location-periods with a sudden immigration spike because of some weird legislative change, etc.), but even there it’s hard to see how you would not have deep confounding. Even so, that seems to me to be best bet for some way to disentangle effects.
“By running a whole lot of spatial queries, I mapped each of the 211,000 CBGs to a FIPS county equivalent…”
Sounds like a lot of work. Your block group attribute table didn’t have a STCOFIPS attribute?
— Matt Frost · Sep 15, 06:38 PM · #
Matt:
LOL…that would have made that step a lot easier!
— Jim Manzi · Sep 15, 07:19 PM · #
I shudder to think of the sliver polygons…
— Matt Frost · Sep 15, 07:56 PM · #
Neat. I know the cross-correlation would be a major problem, but if you throw ln(median(income)) for counties in the explanatory variables, how the does the regression fare? I understand you not even wanting to open that can of worms, but I can’t figure out which way it would bias a result, and it would be interesting to see it with some control on the first moment of income rather than just the 2nd (smart on including std!) and 3rd.
Also – what does a Manzi Inequality measure of 0 correspond to? A 5? The results look consistent through the subsets – nice results.
— rortybomb · Sep 15, 08:14 PM · #
It seems an intuitive point, though, doesn’t it? More inequality = more people at the bottom who have an interest in government redistribution, as well as more rich people who can afford to feel guilty enough to support moderate redistribution.
— Stuart Buck · Sep 15, 09:40 PM · #
Stuart, some “intuitive points” are right but a lot of them are wrong. The great thing that Jim has done here is to test the intuition to find out how accurate it really is.
Also, there one sentence here— “Interactions between drivers are a central, not a peripheral, component of such a complex social phenomenon” — that I wish I could have branded on the foreheads of a few dozen social commentators.
And finally, I know I’m in another world than my usual one when a question like “Your block group attribute table didn’t have a STCOFIPS attribute?” get an LOL response. Yeah, I know what you mean, those crazy STCOFIPS attributes totally crack me up, man!
— Alan Jacobs · Sep 15, 11:36 PM · #
Alan, I’m not questioning the usefulness of the post. Just offering an observation that might be relevant to the causality point . . . even if it’s impossible to determine empirically, it makes intuitive sense that people who live in places of extreme inequality would then become more attuned to redistributionist arguments.
— Stuart Buck · Sep 15, 11:52 PM · #
Not as rigorous in methodology, but interesting to look at and in similar vein:
http://www.amconmag.com/article/2008/feb/11/00016/
— TCO · Sep 16, 12:48 AM · #
This analysis is not spatially valid. You cannot attribute the same GINI coefficient, or any measured value, from two different spatial units, in this case counties to BGs. For example, Los Angeles County , CA has MANY BGs, and income inequality that varies a great deal spatially within the county. Are you saying that all block groups in LA County have the same GINI? This is the problem when amateurs play with real weapons. The interpretation might, or might not, be valid, but your analysis sheds no light on it whatsoever.
— GIS_expert · Sep 16, 01:03 AM · #
@GIS_expert: if I am reading this correctly, the author did not attribute the gini coefficient of the county to the BG’s. Rather, he took the standard deviation of median income for all BG’s within a county to form an independent measure of inequality for the county. Maybe you would have been more tolerant of good practice if you weren’t so quick to show off your expertise in spatial statistics.
— gis_amateur · Sep 16, 01:27 AM · #
Inequality is just a byproduct of what really drives an area Republican or Democratic: the affordability of family formation. Republican family values appeals resound more in areas where people can more afford a house with a yard in a decent school district. Those are typically in inland metropolitan areas where suburbs can expand 360 degrees — i.e., suburban expansion isn’t blocked by ocean or Great Lake.
That’s why Bush carried 25 of the 26 states with the highest white total fertility rate in 2004 and Kerry won the bottom 16.
As somebody above noted, you can see how it all works here:
http://www.amconmag.com/article/2008/feb/11/00016/
By the way, there basically wasn’t any change between 2000 and 2004 — Bush just did 0 to 6 points better in practically every demographic group, state, and even county. The strength of Affordable Family Formation was equally great in both elections. The 1992 and 1996 elections aren’t really usable because Perot messed things up for red vs. blue analysis. So the 1988 election is the latest previous election useful for trend analysis. Affordable Family Formation was important in 1988, but had become significantly more important by 2000-2004.
— Steve Sailer · Sep 16, 01:43 AM · #
I would think that income inequality is a good proxy for urbanization, similar to the point suggested by Steve Sailer above. Would it be possible to re-run your analysis, this time controlling for population density?
— Amateur #2 · Sep 16, 02:09 AM · #
rortybomb:
Thanks.
I have purposely not attempted to do that for the reasons that I described, and that you reference. I am highly skeptical of being able to control for such variables, and due to OVB, improperly controlling for a variable, or even controlling for only a subset of actual variables, will tend to skew parameter estimates even worse than failure to control. That, and I didn’t need to do it for the purposes of the analysis, and only had finite time:)
Yeah, I was expecting a lot of heteroskedascity in that chart, but the residuals plot is almost totally pure – it also blows me away how smart Gauss was.
GIS_expert:
Please see the comment that immediately follows yours.
Steve:
I’ve read a lot of your stuff on AFF, and find it fascinating. Your theory about the underlying causal driver of this effect may be correct, but I don’t have a data-driven opinion on causality here.
There was a decent amount of variation in Bush’s performance in 2004 vs. 2000 at the county level.
amatuer #2:
Please see the first part of my reply to rortybomb.
— Jim Manzi · Sep 16, 03:08 AM · #
Is the issue really inequality, per se?
If a county is all-poor or almost all-poor, do people vote Republican? If a district is all-rich or nearly all-rich, does it vote Republican? If so, Frum’s point would be confirmed.
But my sense is that inequality is not the real issue. Rather, it’s that in America, the very rich and the very poor have BOTH come to identify with the Democratic Party, albeit for very different reasons.
The very poor tend to vote for the party that offers them the most financial benefits. The very rich can afford to ignore basic pocketbook issues and vote on their preferred social issues. A single black mother in Harlem votes a straight Democratic ticket because the Democrats romise to give her more money, while a rich, white Park Avenue matron votes a straight Democratic ticket because she supports gay rights, abortion, and government funding of the arts, and has no personal reasons to worry about street crime or affirmative action (her home has plenty of security, and her kids will get into the Ivy League as legacies).
Now, the Park Avenue matron may vote Republican now and then, IF she has reason to fear massive tax increases. But since the Clinton years, it’s harder to paint the Democrats as the party of 70% tax rates.
The income inequality of Manhattan, by itself, has little to do with how people vote. But if you’re at either end of the income spectrum, you have good reasons to vote Democratic.
— astorian · Sep 16, 02:44 PM · #
Well, I’m happy to stand outside this post and admire your collective mental abilities from farther down the IQ scale.
Would anyone like to discuss which gluten-free cookie mixes taste best for least cost?
— Joules · Sep 16, 04:32 PM · #
By the way, what everybody is really interested in are the differences in voting behavior among white people, which are big, running from Bush getting only about 40% of the white vote in 2004 in Vermont, Massachusetts and Rhode Island to about 75% in Utah. Since whites cast just under 80% of the vote in 2004, that’s pretty much the whole ballgame.
Minority voting patterns are more predictable and less important due to smaller numbers.
A little study I did at the state level in 2000 showed that the type of diversity had an impact on white voting behavior. A lot of blacks in a state was associated with whites toward voting Republican, while a lot of Asians was associated with whites voting Democratic, with Hispanics in the middle.
— Steve Sailer · Sep 16, 07:33 PM · #
Jim,
Do you have a link for the county-level data? I don’t see a source in the Wikipedia article you link to.
— Inductivist · Sep 16, 07:39 PM · #
Re: Those are typically in inland metropolitan areas where suburbs can expand 360 degrees — i.e., suburban expansion isn’t blocked by ocean or Great Lake.
This kinda works for Chicago, but does not apply to the other Great Lakes cities: housing in the Detroit, Milwaukee, Cleveland, Buffalo and Rochester metro areas is quite affordable. Yet the areas are still noted for heavy Democratic voting patterns. You may wish to consider that housing costs are not the only variable is family formation: the availability of jobs which pay a family-sustaining wage (in short supply in the Great Lakes region) also matters.
— Jonfraz · Sep 17, 12:46 AM · #
I think…this can be a bellcurve map too. Its the tails vs the middle chunk.
The l33ts understand that the left tail requires help.
The middle, the “yeoman farmers”, firmly believe that they can bootstrap themselves with their religio-cultural values, and have nothing but envy for the uppertail, and disdain for the lower tail.
— matoko_chan · Sep 17, 01:37 PM · #
I wonder if this means wealth in Ohio is becoming less evenly distributed. Or was 2006 just a result of Republican ineptitude and corruption. . .?
— Julana · Sep 17, 08:29 PM · #
To Joules,
I’m interested in the answer to the gluten-free value cookie question.
When I read Andrew Sullivan, I laugh inside; I can vote.
When I read here, I laugh; I can comment.
And I find the funniest line in the SNL Palin sketch funny to be: “Anyone can run!”
It’s a great country.
— Julana · Sep 19, 01:35 AM · #
I don’t mean to be facetious. Thanks to the bloggers who post here. Great job.
— Julana · Sep 19, 11:22 AM · #
Hi Jim,
Great analysis. As it happen, me and a friend, are currently running an experiment about this phenomenon. I would very much like to talk to you about your analysis. Do not hesitate to drop me a mail.
Cheers,
Lionel
— Lionel · Sep 24, 05:08 PM · #
Dear Jim,
Would you be willing to make your hybrid inequality index publicly available? It would save others the trouble of doing all the joins.
— Dan Goldstein · Sep 25, 09:27 PM · #