TY - JOUR
T1 - The use of differential privacy for census data and its impact on redistricting
T2 - The case of the 2020 U.S. Census
AU - Kenny, Christopher T.
AU - Kuriwaki, Shiro
AU - McCartan, Cory
AU - Rosenman, Evan T.R.
AU - Simko, Tyler
AU - Imai, Kosuke
N1 - Publisher Copyright:
Copyright © 2021 The Authors
PY - 2021/10
Y1 - 2021/10
N2 - Census statistics play a key role in public policy decisions and social science research. However, given the risk of revealing individual information, many statistical agencies are considering disclosure control methods based on differential privacy, which add noise to tabulated data. Unlike other applications of differential privacy, however, census statistics must be postprocessed after noise injection to be usable. We study the impact of the U.S. Census Bureau’s latest disclosure avoidance system (DAS) on a major application of census statistics, the redrawing of electoral districts. We find that the DAS systematically undercounts the population in mixed-race and mixed-partisan precincts, yielding unpredictable racial and partisan biases. While the DAS leads to a likely violation of the “One Person, One Vote”standard as currently interpreted, it does not prevent accurate predictions of an individual’s race and ethnicity. Our findings underscore the difficulty of balancing accuracy and respondent privacy in the Census.
AB - Census statistics play a key role in public policy decisions and social science research. However, given the risk of revealing individual information, many statistical agencies are considering disclosure control methods based on differential privacy, which add noise to tabulated data. Unlike other applications of differential privacy, however, census statistics must be postprocessed after noise injection to be usable. We study the impact of the U.S. Census Bureau’s latest disclosure avoidance system (DAS) on a major application of census statistics, the redrawing of electoral districts. We find that the DAS systematically undercounts the population in mixed-race and mixed-partisan precincts, yielding unpredictable racial and partisan biases. While the DAS leads to a likely violation of the “One Person, One Vote”standard as currently interpreted, it does not prevent accurate predictions of an individual’s race and ethnicity. Our findings underscore the difficulty of balancing accuracy and respondent privacy in the Census.
UR - https://www.scopus.com/pages/publications/85116904019
UR - https://www.scopus.com/inward/citedby.url?scp=85116904019&partnerID=8YFLogxK
U2 - 10.1126/sciadv.abk3283
DO - 10.1126/sciadv.abk3283
M3 - Article
C2 - 34613778
AN - SCOPUS:85116904019
SN - 2375-2548
VL - 7
JO - Science Advances
JF - Science Advances
IS - 41
M1 - eabk3283
ER -