HOME
May 17, 2022 (updated: May 21, 2022)

New Maps: NY “Special-Master” Edition

Updates 5/21/2022: This post has been updated to reflect the final map from the NY state court.

NY has had a long, strange trip to its current proposed congressional district map. In 2014, the New York state legislature passed an anti-gerrymandering law. In 2022, under Dem control, they created new maps based on the new census and, depending on who you ask, skirted the edges of that law or ignored it entirely. A court agreed that the maps ran afoul of the law and handed map-drawing responsibilities to a “special master,” a court appointed district-map-drawing expert.

Details follow. For a summary of our most interesting findings about NY’s new districts, click here.

Last Monday, a draft map was released by the special master, and a slightly different map was finalized by the court late Friday night. Unsurprisingly, this map is less favorable to Dems than their hand-tuned one. There is a lot of disagreement, though, over how to look at this map (NB: these arguments are about the first map released by the special master, not the court’s final map). Quite a bit of that is about frame of reference. The map is defintely worse for Ds than what the legislature tried to do, but seems better for Dems than the current map (the one used from 2012 through the last election in 2020), though also more competitive than either. And it’s that last bit that leaves room for interpretation. In a neutral or good year for Dems, this map is better than what they had before. But in a wave year for Rs, this map could be worse. And, of course, any state or district can defy the national trend.

We haven’t had much time with the map but here’s a quick look!

  1. Dem-lean by district in NY: our demographic model vs. historical data
  2. Why Are These Models So Different?
  3. BlueRipple on other States
  4. Coda #1: Demographics of the new NY districts
  5. Coda #2: Brief intro to our methods (for non-experts)

1. Dem-lean by district in NY: our demographic model vs. historical data

Our “demographic” model forecasts the potential Democratic lean of each new district in NY based on attributes like race, education, and population density. In the graph and table below, we compare our predictions to a “historical” model (from the excellent Dave’s Redistricting (DR) web-site) built up from precinct-level results in prior elections1. (See methods at the end of this post for more details.) The axes show the projected 2-party Dem vote share with each model. The diagonal line represents where districts would fall on this scatter-plot if the two models agreed precisely. In districts to the left of the line, our demographic model thinks the D vote share is higher than historical results, and to the right of the line, we think it’s lower than the historical model predicts2.

We’re not attempting to predict outcomes, or compete with the historical model as an early estimate. Instead, our model looks only at demographic makeup and population density of the district to infer a 2-party share. Large differences between the historical model and our demographic one may point to pickup opportunities or vulnerabilties.

NB: For this and all scatter charts to follow, you can pan & zoom by dragging with the mouse or moving the scroll wheel. To reset the chart, hold shift and click with the mouse.

We generally focus our attention on districts that fall in the 45-55% range of Dem share in our demographic model and 47-53% in the historical model. That’s because we think a 3-point gap is one that either party could potentially close with some focused energy, resources, and strategic thinking. Our demographic model carries some additional uncertainties, so we expand the range a bit there. Our methodology for using our model and the historical data to classify districts is explained in this post.

With that in mind, here are a few observations on the new NY districts:

  • First, let’s dispense with the obvious ones. Several NY districts are clearly far outside the competitive range. Both the demographic model and the historical model agree that 15 districts (NY-4, NY-5, NY-6, NY-7, NY-8, NY-9, NY-10, NY-12, NY-13, NY-14, NY-15, NY-16, NY-17, NY-25, and NY-26) are safe D. Both models also agree that 3 districts (NY-21, NY-23, and NY-24) are far out of reach for Dems.

  • We are much more optimistic about NY-2, NY-3 and NY-11 than the historical model. We think they are all safe D. But the historical model sees them as tossups, with NY-3 as D-leaning, NY-2 as R-leaning and NY-11 as safe-R.

  • Our model sees a strong D-lean in NY-1 where the historical model sees lean-R.

  • We are concerned about vulnerability in NY-19 which we see as a safe R district where the historical model sees a D-leaning toss-up. Less urgently, in NY-20, our model sees a D-leaning toss-up where the historical model sees a Safe D district.

  • The remaining districts (NY-18 and NY-22) are tossups in both models.

We see this as an 18D-4R-4C (“C” for “competitive”) map. The historical analysis, based on the last 3 elections, sees something more like 16D-4R-6C.

The table below shows this in a different form:

Calculated Dem Vote Share, NY 2022: Demographic Model vs. Historical Model (DR)
StateDistrictDemographic Model (Blue Ripple)Historical Model (Dave's Redistricting)BR Stance
NY57983Safe D (No near-term D risk)
NY157988Safe D (No near-term D risk)
NY137892Safe D (No near-term D risk)
NY97878Safe D (No near-term D risk)
NY87879Safe D (No near-term D risk)
NY147581Safe D (No near-term D risk)
NY167272Safe D (No near-term D risk)
NY67268Safe D (No near-term D risk)
NY77184Safe D (No near-term D risk)
NY107187Safe D (No near-term D risk)
NY127086Safe D (No near-term D risk)
NY46857Safe D (No near-term D risk)
NY116646Flippable (Strongly D-leaning)
NY36553Toss-up (Highly Winnable by D)
NY26148Toss-up (Highly Winnable by D)
NY175955Safe D (No near-term D risk)
NY265960Safe D (No near-term D risk)
NY255859Safe D (No near-term D risk)
NY15648Toss-up (Highly Winnable by D)
NY205458Becoming At-Risk (More Balanced than Advertised)
NY185452Toss-up (Down to the Wire)
NY225153Toss-up (Down to the Wire)
NY234439Safe R (No near-term D hope)
NY194351Toss-up (Highly vulnerable for D)
NY244140Safe R (No near-term D hope)
NY214143Safe R (No near-term D hope)

2. Why Are These Models So Different?

Our findings in NY-11, a swingy district which includes Staten Island, provide a good opportunity to discuss why BR’s demographic model and the historical model may differ. In the old map, this district included Staten Island and some of Sunset Park in Brooklyn. The struck-down Dem map added very D-leaning Park Slope into the district, making it a safe-D seat. The new map removes Park Slope and slightly changes the boundaries in southwestern Brooklyn. Under the old map, Democrat Max Rose won this seat in the D-wave of 2018, and promptly lost it to Republican Nicole Malliotakis in 2020.

NY-11 looks like a safe-R district given the voting patterns in the precincts within it, but our demographic model suggests it’s D+16! What might this mean? One way to answer this question is to consider the difference between the two models. Our demographic model asks: if the voting-age citizens of this district turned out and voted like similar people in other parts of the country,  what would we expect the outcome of this election to be? Whereas the historical model asks how we'd expect the election to turn out if the voters in this district turn out and vote as they have in previous elections. This points to a few possible reasons why a historically tossup district like NY-11 might look so strongly D in our model–including, but not limited to, the following:

  • Our model may be wrong about how we define "similar" voters. We've incorporated factors like education and race, but maybe we've missed key things that make voters in NY-11 different from superficially "similar" voters in other districts nationwide.

  • Location-specific factors may suppress Dem voting. E.g., perhaps the Democratic party or local organizations are particularly poorly-organized in NY-11, or voter suppression plays a large role and that is reflected in the historical model using those voters. Location specific history also matters: Staten Island is, historically, a very R leaning part of New York City and our model does not account for that.

  • Democrats in NY-11 may, in fact, have underperformed relative to their potential. Or there may have been demographic shifts in the district since the last election which favor Democrats. NY-11’s Citizen-Voting-Age-Population (CVAP) is about 45% non-White (about an even mix of Hispanic and Asian CVAP, as well as a smaller but significant Black CVAP) and about 35% of that CVAP graduted from college. Such districts are usually good for Dem candidates.

We don't know which (if any) of these explanations is correct. But our model suggests that NY-11 might be an easier pickup for Dems than the history indicates.

3. BlueRipple on Other States

  • Here are our takes on the new maps in Arizona, Georgia, Michigan, North Carolina, Pennsylvania, and Texas.

  • We’re going to continue to refine and improve our demographic model–we’ll update this post and others as we do so. Feel free to contact us if you want more details on the mechanics, or if you’d like to propose changes or improvements.

  • As maps get solidified, we’ll set up ActBlue donation links for candidates (after the primaries) to make it easy for you to donate.

If you want to stay up-to-date, please sign up for our email updates! We’re also on Twitter, Facebook, and Github.

4. Coda #1: Demographics of new vs. old NY districts

One thing we haven’t seen discussed very much is how redistricting in NY has changed the demographics in each district. As a way of putting the demographic model results in context, let’s look at the underlying population two different ways:

  • The first chart below shows each of NY’s proposed 2022 districts, with the population broken down by race/ethnicity (Black, Hispanic, Asian, White-non-Hispanic and other) and education (college graduate and non-college graduate). Each bar also has a dot representing the (logarithmic) population density3 of the district. The scale for that dot is on the right-side axis of the chart. For reference, a log density of 5 (NY-21) represents about 150 people per square mile and a log density of 11.5 (NY-12 and NY-13) represents about 100,000 people per square mile. We’ve ordered the districts by D-share based on our demographic model, which is helpful for understanding how the model responds to demographics and density.

  • In the second chart, we look at these demographics a different way, placing each NY district according to its proportion of college graduates and non-white citizens of voting age. We also indicate (logarithmic) population density via the size of the circle and modeled D-edge (D-share minus 50%) via color. This makes it easier to see that the model predicts larger D vote-share as the district becomes more educated, more non-white and more dense.

It’s hard to see anything specific from these charts, though we are continuing to examine them as we try to understand what might be happening in each specific district.

5. Coda #2: Brief intro to our methods (for non-experts)

This part of the post contains a general summary of the math behind what we’re doing here intended for non-experts. If you want even more technical details, check out the links at the end of this section, visit our Github page, or contact us directly.

Our model is demographic. We use turnout data from the 2020 CPS voter supplement (a self-reported survey); voting and turnout data from the 2020 CES (a validated survey); and election result data from the 2020 presidential, senate and house elections. The survey data from the CPS and CES is broken down by several demographic categories, including sex, education and race/ethnicity.

The election results are trickier to use in the model since we don’t have demographic information paired with with turnout or vote choice. What we do know is the overall demographics of the state or house district. So we use the election-data to assign a likelihood to the post-stratification of our parameters across the demographics of the relevant region (from the micro-data ACS).

Then we look at the demographics of a particular house or state-legislative district (using tract-level census data from the ACS), breaking it down into the same categories and then apply our model of turnout and voter preference to estimate the 2-party vote share we expect for a Democratic candidate.

This is in contrast to what we call the historical model: a standard way to predict “partisan lean” for any district, old or new: break it into precincts with known voting history (usually a combination of recent presidential, senate and governors races) and then aggregate those results to estimate expected results in the district.

The historical model is likely to be a pretty accurate “predictor” if you think the same people will vote the same way in subsequent elections, regardless of where the district lines lie. So why did we build a demographic model? Three reasons:

  1. We’re interested in places where the history may be misleading, either because of the specific story in a district or because changing politics or demographics may have altered the balance of likely voters.4

  2. Our demographic analysis is potentially more useful when the districts are new, since voting history may be less “sticky” there. For example, if I’m a Dem-leaning voter in a strong-D district, I might not have bothered voting much in the past because I figured my vote didn’t matter. But if I now live in a district that’s more competitive in the new map, I might be much more likely to turn out.

  3. We’re not as interested in predicting what will happen in each district, but what plausibly could happen in each district if Dems applied resources in the right way, or fail to when the Republicans do. The historical model is backward-looking, whereas our demographic model is forward-looking making them complementary when it comes to strategic thinking.

Two final points. First, when it comes to potential Dem share in each district, we’re continuing to improve and refine our demographic model. The Blue Ripple web-site contains more details on how it works and some prior results of applying a similar model to state legislative districts, something we will also do more of in the near future. Second, for the historical model comparator, we use data from the excellent “Dave’s Redistricting”, which is also the source of our maps for the new districts.

Want to read more from Blue Ripple? Visit our website, sign up for email updates, and follow us on Twitter and FaceBook. Folks interested in our data and modeling efforts should also check out our Github page.


  1. One important note about the numbers. Dave’s Redistricting gives estimates of Democratic candidate votes, Republican candidate votes and votes for other candidates. We’ve taken those numbers and computed 2-party vote share for the Democratic candidate, that is, D Votes/(D Votes + R Votes). That makes it comparable with the Demographic model which also produces 2-party vote share.↩︎

  2. We’ve also done this modeling for the old districts and compared that result to the actual 2020 election results. See here.↩︎

  3. We use logarithms here because density varies tremendously over districts, from tens to hundreds of thousands of people per square mile. We use population-weighting because the resulting average more closely expresses the density of where people actually live. For example, consider a district made up of a high-density city where 90% of the population live and then large but low-density exurbs where the other 10% live. Most people in that district live at high density and we want our density to reflect that even though the unweighted average density (people/district size) might be smaller.↩︎

  4. We’re also interested in voter empowerment strategies. In particular, questions about where and among whom, extra turnout might make a difference. The historical model is no help here since it does not attempt to figure out who is voting or who they are voting for in a demographically specific way.↩︎