TY - JOUR
T1 - Goodness-of-fit testing
T2 - The thresholding approach
AU - Kim, Min Hee
AU - Akritas, Michael G.
N1 - Funding Information:
This research was supported in part by NSF grant DMS-0805598.
PY - 2012/3
Y1 - 2012/3
N2 - The classical Pearson's chi-square test for goodness-of-fit has found extensive applications in areas such as contingency tables and, recently, multiple testing. Mann and Wald [(1942), 'On the Choice of the Number of Class Intervals in the Application of the Chi Square Test', The Annals of Mathematical Statistics, 13, 306-317] were the first to establish the power advantages of letting the number n bin of bins tend to infinity with n, and found n bin=n 2/5 to be the optimal rate. For a corresponding development in the area of contingency tables, see Holst [(1972), 'Asymptotic Normality and Efficiency for Certain Goodness-of-Fit Tests', Biometrika, 59, 137-145], Morris [(1975), 'Central Limit Theorems for Multinomial Sums', The Annals of Statistics, 3, 165-188], and Koehler and Larntz [(1980), 'An Empirical Investigation of Goodness-of-Fit Statistics for Sparse Multinomials', Journal of the American Statistical Association, 75, 336-344]. In this paper, we consider the use of thresholding methods to further improve on the power of Pearson's chi-square test. An alternative statistic, based on the cell averages, is also studied. The Fourier or wavelet transformation is used to ensure power enhancement in both high- and low-signal-to-noise ratio alternatives. Simulations suggest that application of order thresholding (Kim, M.H., and Akritas, M.G. (2010), 'Order Thresholding', The Annals of Statistics, 38, 2314-2350) achieves accurate type I error rates, and competitive power.
AB - The classical Pearson's chi-square test for goodness-of-fit has found extensive applications in areas such as contingency tables and, recently, multiple testing. Mann and Wald [(1942), 'On the Choice of the Number of Class Intervals in the Application of the Chi Square Test', The Annals of Mathematical Statistics, 13, 306-317] were the first to establish the power advantages of letting the number n bin of bins tend to infinity with n, and found n bin=n 2/5 to be the optimal rate. For a corresponding development in the area of contingency tables, see Holst [(1972), 'Asymptotic Normality and Efficiency for Certain Goodness-of-Fit Tests', Biometrika, 59, 137-145], Morris [(1975), 'Central Limit Theorems for Multinomial Sums', The Annals of Statistics, 3, 165-188], and Koehler and Larntz [(1980), 'An Empirical Investigation of Goodness-of-Fit Statistics for Sparse Multinomials', Journal of the American Statistical Association, 75, 336-344]. In this paper, we consider the use of thresholding methods to further improve on the power of Pearson's chi-square test. An alternative statistic, based on the cell averages, is also studied. The Fourier or wavelet transformation is used to ensure power enhancement in both high- and low-signal-to-noise ratio alternatives. Simulations suggest that application of order thresholding (Kim, M.H., and Akritas, M.G. (2010), 'Order Thresholding', The Annals of Statistics, 38, 2314-2350) achieves accurate type I error rates, and competitive power.
UR - http://www.scopus.com/inward/record.url?scp=84863127918&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863127918&partnerID=8YFLogxK
U2 - 10.1080/10485252.2011.606367
DO - 10.1080/10485252.2011.606367
M3 - Article
AN - SCOPUS:84863127918
SN - 1048-5252
VL - 24
SP - 119
EP - 138
JO - Journal of Nonparametric Statistics
JF - Journal of Nonparametric Statistics
IS - 1
ER -