TY - JOUR
T1 - Differentially private inference for binomial data
AU - Awan, Jordan
AU - Slavković, Aleksandra
N1 - Funding Information:
We thank Vishesh Karwa and Matthew Reimherr for helpful discussions and feedback on previous drafts; Anton Xue for helping to develop some of the R code; and the reviewers for their helpful comments and suggestions, which improved the completeness and accessibility of this paper. This work is supported in part by NSF Award No. SES-1534433 to The Pennsylvania State University. We thank the Simons Institute for the Theory of Computing and the Center for Research on Computation and Society at Harvard Univerisity for their hospitality during part of this work.
Publisher Copyright:
© J. Awan and A. Slavković.
PY - 2020
Y1 - 2020
N2 - We derive uniformly most powerful (UMP) tests for simple and one-sided hypotheses for a population proportion within the framework of differential privacy (DP), optimizing finite sample performance. We show that in general, DP hypothesis tests can be written in terms of linear constraints, and for exchangeable data can always be expressed as a function of the empirical distribution. Using this structure, we prove a ‘Neyman-Pearson Lemma’ for binomial data under DP, where the DP-UMP only depends on the sample sum. Our tests can also be stated as a post-processing of a DP summary statistic, whose distribution we coin “Truncated-Uniform-Laplace” (Tulap), a generalization of the Staircase and discrete Laplace distributions. We show that by post-processing the Tulap statistic, we are able to obtain exact p-values corresponding to the DP-UMP, uniformly most accurate (UMA) one-sided confidence intervals, optimal confidence distributions, uniformly most powerful unbiased (UMPU) two-sided tests, and uniformly most accurate unbiased (UMAU) two-sided confidence intervals. As each of these quantities are a post-processing of the same summary statistic, there is no increased cost to privacy by including these additional results, allowing for a complete statistical analysis at a fixed privacy cost. We also show that our results can be applied to distribution-free hypothesis tests for continuous data. Our simulation results demonstrate that all our tests have exact type I error, and are more powerful than current techniques.
AB - We derive uniformly most powerful (UMP) tests for simple and one-sided hypotheses for a population proportion within the framework of differential privacy (DP), optimizing finite sample performance. We show that in general, DP hypothesis tests can be written in terms of linear constraints, and for exchangeable data can always be expressed as a function of the empirical distribution. Using this structure, we prove a ‘Neyman-Pearson Lemma’ for binomial data under DP, where the DP-UMP only depends on the sample sum. Our tests can also be stated as a post-processing of a DP summary statistic, whose distribution we coin “Truncated-Uniform-Laplace” (Tulap), a generalization of the Staircase and discrete Laplace distributions. We show that by post-processing the Tulap statistic, we are able to obtain exact p-values corresponding to the DP-UMP, uniformly most accurate (UMA) one-sided confidence intervals, optimal confidence distributions, uniformly most powerful unbiased (UMPU) two-sided tests, and uniformly most accurate unbiased (UMAU) two-sided confidence intervals. As each of these quantities are a post-processing of the same summary statistic, there is no increased cost to privacy by including these additional results, allowing for a complete statistical analysis at a fixed privacy cost. We also show that our results can be applied to distribution-free hypothesis tests for continuous data. Our simulation results demonstrate that all our tests have exact type I error, and are more powerful than current techniques.
UR - http://www.scopus.com/inward/record.url?scp=85088243714&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85088243714&partnerID=8YFLogxK
U2 - 10.29012/jpc.725
DO - 10.29012/jpc.725
M3 - Article
AN - SCOPUS:85088243714
SN - 2575-8527
VL - 10
SP - 1
EP - 40
JO - Journal of Privacy and Confidentiality
JF - Journal of Privacy and Confidentiality
IS - 1
ER -