TY - JOUR
T1 - A LATENT CLASS MODELING APPROACH FOR GENERATING SYNTHETIC DATA AND MAKING POSTERIOR INFERENCES FROM DIFFERENTIALLY PRIVATE COUNTS
AU - Nixon, Michelle Pistner
AU - Barrientos, Andrés F.
AU - Reiter, Jerome P.
AU - Slavkovic, Aleksandra
N1 - Funding Information:
4The NLTCS (National Long Term Care Study) is sponsored by the National Institute of Aging and was conducted by the Duke University Center for Demographic Studies under Grant No. U01-AG007198.
Funding Information:
This work was supported by the Pennsylvania State University, Duke University, and the National Science Foundation under awards 1443014, 1144860, 1534433, and 1131897.
Publisher Copyright:
© M. Nixon, A. Barrientos, J. Reiter, and A. Slavković.
PY - 2022/7/29
Y1 - 2022/7/29
N2 - Several algorithms exist for creating differentially private counts from contingency tables, such as two-way or three-way marginal counts. The resulting noisy counts generally do not correspond to a coherent contingency table, so that some post-processing step is needed if one wants the released counts to correspond to a coherent contingency table. We present a latent class modeling approach for post-processing differentially private marginal counts that can be used (i) to create differentially private synthetic data from the set of marginal counts, and (ii) to enable posterior inferences about the confidential counts. We illustrate the approach using a subset of the 2016 American Community Survey Public Use Microdata Sets and the 2004 National Long Term Care Survey.
AB - Several algorithms exist for creating differentially private counts from contingency tables, such as two-way or three-way marginal counts. The resulting noisy counts generally do not correspond to a coherent contingency table, so that some post-processing step is needed if one wants the released counts to correspond to a coherent contingency table. We present a latent class modeling approach for post-processing differentially private marginal counts that can be used (i) to create differentially private synthetic data from the set of marginal counts, and (ii) to enable posterior inferences about the confidential counts. We illustrate the approach using a subset of the 2016 American Community Survey Public Use Microdata Sets and the 2004 National Long Term Care Survey.
UR - http://www.scopus.com/inward/record.url?scp=85135151272&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85135151272&partnerID=8YFLogxK
U2 - 10.29012/jpc.768
DO - 10.29012/jpc.768
M3 - Article
AN - SCOPUS:85135151272
SN - 2575-8527
VL - 12
JO - Journal of Privacy and Confidentiality
JF - Journal of Privacy and Confidentiality
IS - 1
ER -