TY - JOUR
T1 - A High-Dimensional Nonparametric Multivariate Test for Mean Vector
AU - Wang, Lan
AU - Peng, Bo
AU - Li, Runze
N1 - Funding Information:
Lan Wang (E-mail: [email protected]) is Associate Professor, and Bo Peng is graduate student (E-mail: [email protected]), School of Statistics, University of Minnesota, Minneapolis, MN 55455. Runze Li is the corresponding author and Distinguished Professor, Department of Statistics and the Methodology Center, the Pennsylvania State University, University Park, PA 16802-2111 (E-mail: [email protected]).Wang and Peng’s research is supported by an NSF grant DMS1308960. Li’s research is supported by NIDA, NIH grants P50 DA10075 and P50 DA036107. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIDA or the NIH. The authors thank the editor, the AE, and three referees for their constructive comments that help us significantly improve the article. The authors also thank Professor Tiefeng Jiang for helpful discussions.
Publisher Copyright:
© 2015, © American Statistical Association.
PY - 2015/10/2
Y1 - 2015/10/2
N2 - This work is concerned with testing the population mean vector of nonnormal high-dimensional multivariate data. Several tests for high-dimensional mean vector, based on modifying the classical Hotelling T2 test, have been proposed in the literature. Despite their usefulness, they tend to have unsatisfactory power performance for heavy-tailed multivariate data, which frequently arise in genomics and quantitative finance. This article proposes a novel high-dimensional nonparametric test for the population mean vector for a general class of multivariate distributions. With the aid of new tools in modern probability theory, we proved that the limiting null distribution of the proposed test is normal under mild conditions when p is substantially larger than n. We further study the local power of the proposed test and compare its relative efficiency with a modified Hotelling T2 test for high-dimensional data. An interesting finding is that the newly proposed test can have even more substantial power gain with large p than the traditional nonparametric multivariate test does with finite fixed p. We study the finite sample performance of the proposed test via Monte Carlo simulations. We further illustrate its application by an empirical analysis of a genomics dataset. Supplementary materials for this article are available online.
AB - This work is concerned with testing the population mean vector of nonnormal high-dimensional multivariate data. Several tests for high-dimensional mean vector, based on modifying the classical Hotelling T2 test, have been proposed in the literature. Despite their usefulness, they tend to have unsatisfactory power performance for heavy-tailed multivariate data, which frequently arise in genomics and quantitative finance. This article proposes a novel high-dimensional nonparametric test for the population mean vector for a general class of multivariate distributions. With the aid of new tools in modern probability theory, we proved that the limiting null distribution of the proposed test is normal under mild conditions when p is substantially larger than n. We further study the local power of the proposed test and compare its relative efficiency with a modified Hotelling T2 test for high-dimensional data. An interesting finding is that the newly proposed test can have even more substantial power gain with large p than the traditional nonparametric multivariate test does with finite fixed p. We study the finite sample performance of the proposed test via Monte Carlo simulations. We further illustrate its application by an empirical analysis of a genomics dataset. Supplementary materials for this article are available online.
UR - http://www.scopus.com/inward/record.url?scp=84954418961&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84954418961&partnerID=8YFLogxK
U2 - 10.1080/01621459.2014.988215
DO - 10.1080/01621459.2014.988215
M3 - Article
AN - SCOPUS:84954418961
SN - 0162-1459
VL - 110
SP - 1658
EP - 1669
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 512
ER -