{"id":264,"date":"2012-11-14T16:26:54","date_gmt":"2012-11-14T16:26:54","guid":{"rendered":"http:\/\/blog.fellstat.com\/?p=264"},"modified":"2012-11-14T16:26:54","modified_gmt":"2012-11-14T16:26:54","slug":"how-the-democrats-may-have-won-the-house-but-lost-the-seats","status":"publish","type":"post","link":"https:\/\/blog.fellstat.com\/?p=264","title":{"rendered":"How the Democrats may have won the House, but lost the seats"},"content":{"rendered":"<p>&nbsp;<\/p>\n<p>The 2012 election is over and in the books. A few very close races remain to be officially decided, but for the most part everything has settled down over the last week. By all\u00a0accounts\u00a0it was a very good night for the Democrats, with wins in the presidency, senate and state houses. They also performed better than expected in the House of Representatives. If we take the results as they stand now, the Democrats\u00a0will have 202 seats v.s. the Republican&#8217;s 233 seats, a pick-up of 9 relative to the 2010 election. This got me thinking about how the Republicans could have maintained 54% of the house seats during a Democratic wave election.<\/p>\n<p>Since I&#8217;m a statistician, the obvious answer was to go get the data and find out. Over the last few days, I was able to obtain unofficial vote counts for each congressional race except 3. The results from two unopposed Democrats in Massachusetts&#8217; 1st and 2nd district, and one unopposed Republican from Kansas&#8217; 1st district will not be available until the final canvas is done in December. Not including these districts\u00a055,709,796 votes were cast for Democrats, v.s.\u00a055,805,487 for Republicans, a difference of less than 100,000 votes.<\/p>\n<p>Both of the candidates from Massachusetts also ran unopposed in 2008, where they garnered 234,369 and 227,619 votes respectively, which seems like a good estimate for what we can expect this year. In 2008 Kansas&#8217; 1st district had a total of\u00a0262,027 votes cast. We will assign all of them to the Republican candidate as an (over) estimate, as the old 2002 district lines are similar to their 2012 placements. Using these numbers we get\u00a0estimated\u00a0vote counts of\u00a056,171,784 for the Democrats and\u00a056,067,514 for the Republicans, leading to a 100,000 vote advantage for the Democrats. Either way, this election was a real\u00a0squeaker\u00a0in terms of popular vote, which in percentage terms is\u00a050.046% to 49.954%. Until the full official results are released, I would classify this as &#8220;too close to call,&#8221; but find a Democrat popular vote victory likely.<\/p>\n<p><a href=\"http:\/\/fivethirtyeight.blogs.nytimes.com\/2012\/11\/12\/turnout-steady-in-swing-states-and-down-in-others-but-many-votes-remain-uncounted\/\">Nate Silver<\/a> recently looked at the differences in the number of votes cast in 2008 v.s. those already reported in 2012, and found that California has reported 3.4 million fewer votes this election than 2008, suggesting that millions of mail-in ballots remain waiting to be certified. Somewhat similarly, New York reported 1.5 million fewer votes, though this may be due (at least in part) to the effects of hurricane Sandy. These were the only two states with differences &gt; 1 million, and they both lean heavily Democratic, which may well push the popular vote totals for the Democrats into &#8220;clear winner&#8221; territory.<\/p>\n<p>In a certain sense, the winner of the popular vote is\u00a0irrelevant, as the only thing that matters in terms of power is the number of seats\u00a0held\u00a0by a party. But if the Democrats win, then they can reasonably claim that they (even as the minority party) are the ones with the mandate to enact their agenda.<\/p>\n<p>Even though the popular vote is razor close in the current unofficial results, the distribution of seats is certainly not. We shouldn&#8217;t expect there to be a perfect\u00a0correspondence\u00a0between the popular vote and seats, because who wins the seats depends on how the population is\u00a0divided\u00a0up into congressional districts. As a simplified example consider three districts, each with 100,000 voters. District 1 is heavily Democratic, with 100% of votes going to the democratic candidate, while\u00a0districts\u00a02 and 3 are swing districts where 51% of the vote goes to the Republicans. In this simplified example, the Democrats would win nearly 2\/3 of the popular vote, but only 1\/3 of the seats.<\/p>\n<p>So, if we expect there to be a discrepancy between popular vote and seat totals, the question then becomes: Is this election unusual? To answer this, we can look at the official\u00a0<a href=\"http:\/\/clerk.house.gov\/member_info\/electionInfo\/index.aspx\">historical record<\/a>\u00a0going back to 1942 which gives us the following:<\/p>\n<figure id=\"attachment_266\" aria-describedby=\"caption-attachment-266\" style=\"width: 550px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/blog.fellstat.com\/wp-content\/uploads\/2012\/11\/vote_to_seats.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-266\" title=\"Relationship between popular vote, and share of congressional seats\" src=\"http:\/\/blog.fellstat.com\/wp-content\/uploads\/2012\/11\/vote_to_seats-1024x853.png\" alt=\"\" width=\"550\" height=\"458\" \/><\/a><figcaption id=\"caption-attachment-266\" class=\"wp-caption-text\">Relationship between popular vote, and share of congressional seats (1942-2012)<\/figcaption><\/figure>\n<p>In almost every election in the last 70 years, the party with the popular vote victory also won the majority of seats. The only exception to this was 1996, where the\u00a0Democrats\u00a0won the popular vote, but failed to attain a majority in congress. If 2012 turns out to be a popular vote victory for the Democrats, then their seat deficit would be unheard of in living memory. On the other hand, there was a very close election in 1950, where the Republicans lost by a margin of just 0.04%, but ended up with only 199 seats to the Democrats 234.<\/p>\n<h1 style=\"text-align: center;\">Redistricting, Gerrymandering, oh my!<\/h1>\n<p>Whether or not the official vote tally goes to the Democrats, the 2012 House of Representatives electoral landscape is unusual. The seat totals clearly favor the Republicans, but why? Some have noted that the congressional district lines were redrawn following the 2010 Census, a process that in many states is\u00a0controlled\u00a0by the state legislature, and that many of these lines seem to be constructed to maximize the political advantage of one party or the other. I was initially\u00a0skeptical\u00a0of these claims. After all, the two years most comparable to 2012 are 1996 and 1950, neither of which are particularly near a redistricting period. As is often the case, an examination of the data revealed evidence that was counter to my expectations.<\/p>\n<p>First, let us explore how one would draw lines that maximize party advantage. Remember in our simple three district example, the advantage was gained by bunching all of the\u00a0Democratic\u00a0voters into one district, and maintaining slim advantages in the other two. So, the rule is to spread out your own vote into many districts, and\u00a0consolidate\u00a0your opponents voters into just a few. This can be a difficult geometric &#8220;problem&#8221; to solve, but an easy way to think about one possible route toward a solution is to focus on the swing districts. If a district is very close, you will want to alter its location so that it has just enough more Republican votes to make it a safe bet for your party. Ideally you will want to take these Republican votes from a safe Democratic district, making it even more Democratic. What you can then expect when you look at the distribution of districts is a clump of very safe Democratic districts, a clump of safe, but not overwhelmingly safe, Republican districts, and very few toss-up districts. This is what we call a <a href=\"http:\/\/en.wikipedia.org\/wiki\/Bimodal_distribution\">bimodal distribution<\/a>, and\u00a0the process of redistricting with a focus on political gain is known as <a href=\"http:\/\/en.wikipedia.org\/wiki\/Gerrymandering\">gerrymandering<\/a>. There is nothing particularly Republican about it. Both political parties engage in gerrymandering, the difference being whether or not they\u00a0<a href=\"http:\/\/redistricting.lls.edu\/who-partyfed.php\">control the redistricting process<\/a>, and to what degree are they willing to contort the shape of their districts.<\/p>\n<table border=\"0\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"top\" width=\"221\"><strong>Authority to draw lines<\/strong><\/td>\n<td valign=\"top\" width=\"221\"><strong># of Congressional districts<\/strong><\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"221\">Democratic party<\/td>\n<td valign=\"top\" width=\"221\">44<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"221\">Republican party<\/td>\n<td valign=\"top\" width=\"221\">174<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"221\">Split between parties<\/td>\n<td valign=\"top\" width=\"221\">83<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"221\">Independent Commission<\/td>\n<td valign=\"top\" width=\"221\">88<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"221\">Only one district in the state<\/td>\n<td valign=\"top\" width=\"221\">7<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"221\">Non-partisan legislature<\/td>\n<td valign=\"top\" width=\"221\">3<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"221\">Temporary court drawn<\/td>\n<td valign=\"top\" width=\"221\">36<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The Republican party was in control of the redistricting of 174 districts, v.s. just 44 for the Democrats.\u00a0Just to give a\u00a0specific\u00a0example, Ohio had a Republican controlled redistricting process, which though wildly\u00a0successful\u00a0for Republicans, lead to\u00a0torturously\u00a0shaped districts. Republicans won 52.5% of the vote in Ohio, but obtained a staggering 75% of the seats.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-thumbnail wp-image-275\" title=\"ohio_statewide\" src=\"http:\/\/blog.fellstat.com\/wp-content\/uploads\/2012\/11\/ohio_statewide-150x150.png\" alt=\"\" width=\"150\" height=\"150\" \/><\/p>\n<p>But that is just one state, what about the rest of the districts? If we plot the proportion of people in each district that go to the Democrats, broken down by who was in control of\u00a0redistricting\u00a0we see clear evidence of clumping into safe, but not overwhelmingly safe\u00a0territory\u00a0for the redistricting party.<\/p>\n<figure id=\"attachment_276\" aria-describedby=\"caption-attachment-276\" style=\"width: 550px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/blog.fellstat.com\/wp-content\/uploads\/2012\/11\/vote_share.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-276\" title=\"Share of vote by redistricting party\" src=\"http:\/\/blog.fellstat.com\/wp-content\/uploads\/2012\/11\/vote_share-1024x713.png\" alt=\"\" width=\"550\" height=\"382\" \/><\/a><figcaption id=\"caption-attachment-276\" class=\"wp-caption-text\">Share of vote by who controlled redistricting<\/figcaption><\/figure>\n<p>The way to read this plot is that each point represents a district, with its position being the proportion of people in the district who voted for the Democratic candidate. The lines represent a smoothed estimate of the distribution, with wider parts indicating increased density. Among the districts where Republicans had control, we see a characteristic bimodal distribution, with many districts chosen to be relatively safe Republican strongholds, with a smaller group of heavily Democratic districts. The same trend may be present in the Democratically controlled districts, but there are too few of them to see the trend clearly. Alternatively, in districts where the\u00a0decision making\u00a0is split between parties, or delegated to an\u00a0independent\u00a0group, we see a more natural distribution of vote shares, with a good number of toss-up districts.<\/p>\n<h3 style=\"text-align: center;\">But can we prove that it\u00a0swung\u00a0the election?<\/h3>\n<p>To parse out the effect of redistricting on the election results in a non-descriptive sense, it is\u00a0necessary\u00a0to construct (a simple idealized) model of how\u00a0redistricting\u00a0affects the probability of winning a district. To do this, we assume that the probability of winning a district within a state is related to the\u00a0partisanship\u00a0of the state (i.e. we expect a higher proportion of Republican seats in Alabama as compared to New York), which we will measure by the share of the vote that Romney won in the state. Second, control of redistricting (either Democratic, Republican, or\u00a0Independent\/Split) yields an increase (or decrease) in the odds of a win. We slap these two together into a standard statistical model known as a <a href=\"http:\/\/en.wikipedia.org\/wiki\/Logistic_regression\">logistic regression<\/a>, which\u00a0yields\u00a0the following:<\/p>\n<table border=\"1\" cellspacing=\"0\" cellpadding=\"0\">\n<tbody>\n<tr>\n<td valign=\"top\" width=\"136\"><strong>Variable<\/strong><\/td>\n<td valign=\"top\" width=\"86\"><strong>Df<\/strong><\/td>\n<td valign=\"top\" width=\"111\"><strong>Chi-squared<\/strong><\/td>\n<td valign=\"top\" width=\"111\"><strong>p-value<\/strong><\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"136\">Romney vote<\/td>\n<td valign=\"top\" width=\"86\">1<\/td>\n<td valign=\"top\" width=\"111\">31.4<\/td>\n<td valign=\"top\" width=\"111\">&lt;0.001<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"136\">Redistricting control<\/td>\n<td valign=\"top\" width=\"86\">2<\/td>\n<td valign=\"top\" width=\"111\">9.1<\/td>\n<td valign=\"top\" width=\"111\">\u00a0 0.010<\/td>\n<\/tr>\n<tr>\n<td valign=\"top\" width=\"136\">Residuals<\/td>\n<td valign=\"top\" width=\"86\">385<\/td>\n<td valign=\"top\" width=\"111\"><\/td>\n<td valign=\"top\" width=\"111\"><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>This table tells us two important things through its p-values. With p-values, the closer to 0, the more significant the relationship. First, that the\u00a0partisanship\u00a0of a district matters, because the proportion voting for Romney is very significantly related to the probability of congressional control. Secondly, control of redistricting matters. If the redistricting process were fair, we would see the magnitude of the trends that we see in this data only once every thousand years (10 years between redistricting \/ 0.010 ). This gives us a high degree of confidence that political concerns are the driving force in redistricting when politicians are put in charge of it (shocking, I know).<\/p>\n<p>This effect is\u00a0relatively\u00a0large. Consider an idealized swing state, where Romney won exactly 50% of the vote. Our model estimates the proportion of the congressional delegation who are Democrats as:<\/p>\n<figure id=\"attachment_278\" aria-describedby=\"caption-attachment-278\" style=\"width: 550px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/blog.fellstat.com\/wp-content\/uploads\/2012\/11\/swing.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-278\" title=\"Swing state estimates\" src=\"http:\/\/blog.fellstat.com\/wp-content\/uploads\/2012\/11\/swing-1024x713.png\" alt=\"\" width=\"550\" height=\"382\" \/><\/a><figcaption id=\"caption-attachment-278\" class=\"wp-caption-text\">Estimates of the average proportion of a congressional delegation from a true swing state who are Democrats based on a logistic regression<\/figcaption><\/figure>\n<p>Here the model estimates that districts from an independent\/split source are roughly unbiased with an estimated proportion of seats of 48%. If the Republicans control the process then we would expect only 31% of the delegation to be Democrats. This is slightly above what we observed in Ohio, where only 25% of the delegation were Democrats,\u00a0despite\u00a0Romney&#8217;s percentage of the vote being nearly 50%. With the Democrats in control we see a smaller bias than the Republicans with an estimated 56% belonging to the Democratic party, but our confidence in this estimate (denoted by the red dotted lines) is small due to the small number of redistricting opportunities that they had.<\/p>\n<h3 style=\"text-align: center;\">A perfect world<\/h3>\n<p>What would have happened if the entire redistricting process was controlled\u00a0by independent or\u00a0bipartisan\u00a0agreement? Well, our simple model can tell us something about that. By counterfactually assuming that all redistricting was done\u00a0independently\u00a0or by split legislature, it estimates that the proportion of Democrats in the current congress would be, on average, 52%. If Democrats got to pick all the lines it would be 59%, and under complete Republican control it would be just 37%.<\/p>\n<h1 style=\"text-align: center;\">\u00a0Conclusions<\/h1>\n<p>So what\u00a0lessons\u00a0can we take away from this analysis? Control of the redistricting process is a powerful tool for those who would use it as a tool. The model estimates that who controls the process could\u00a0yield\u00a0 17% point swings (56%-39%) in a typical swing state. This is consistent both with the results from the general election as a whole, and Ohio in particular. While I would love to see the implementation of an <a href=\"http:\/\/en.wikipedia.org\/wiki\/Gerrymandering#Objective_rules_to_create_districts\">algorithmic solution<\/a> to the problem of district formation (if only to hear the pundits trying to say minimum isoperimetric quotient), it\u00a0appears\u00a0that\u00a0independent\u00a0commissions, or even\u00a0bipartisan\u00a0agreement within a legislative body are sufficient to have fair lines. As of now, 6 states decide their districts based on an independent\u00a0commission, and it is hard to think of an honest argument why every state should not adopt this model. Economics teaches us that\u00a0incentives\u00a0matter, and if you give\u00a0politicians\u00a0the incentive to bias districts in their favor, it is a safe bet that they will do so.<\/p>\n<p>&nbsp;<\/p>\n<p style=\"text-align: center;\">&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;<\/p>\n<p><a href=\"http:\/\/www.fellstat.com\/\">Fellows Statistics<\/a>\u00a0provides experienced, professional statistical advice and analysis for the corporate and academic worlds.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>&nbsp; The 2012 election is over and in the books. A few very close races remain to be officially decided, but for the most part everything has settled down over the last week. By all\u00a0accounts\u00a0it was a very good night for the Democrats, with wins in the presidency, senate and state houses. They also performed [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"class_list":["post-264","post","type-post","status-publish","format-standard","hentry","category-r"],"_links":{"self":[{"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=\/wp\/v2\/posts\/264","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=264"}],"version-history":[{"count":0,"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=\/wp\/v2\/posts\/264\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=264"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=264"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.fellstat.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=264"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}