TY - JOUR
T1 - Variable Screening via Quantile Partial Correlation
AU - Ma, Shujie
AU - Li, Runze
AU - Tsai, Chih Ling
N1 - Publisher Copyright:
© 2017 American Statistical Association.
PY - 2017/4/3
Y1 - 2017/4/3
N2 - In quantile linear regression with ultrahigh-dimensional data, we propose an algorithm for screening all candidate variables and subsequently selecting relevant predictors. Specifically, we first employ quantile partial correlation for screening, and then we apply the extended Bayesian information criterion (EBIC) for best subset selection. Our proposed method can successfully select predictors when the variables are highly correlated, and it can also identify variables that make a contribution to the conditional quantiles but are marginally uncorrelated or weakly correlated with the response. Theoretical results show that the proposed algorithm can yield the sure screening set. By controlling the false selection rate, model selection consistency can be achieved theoretically. In practice, we proposed using EBIC for best subset selection so that the resulting model is screening consistent. Simulation studies demonstrate that the proposed algorithm performs well, and an empirical example is presented. Supplementary materials for this article are available online.
AB - In quantile linear regression with ultrahigh-dimensional data, we propose an algorithm for screening all candidate variables and subsequently selecting relevant predictors. Specifically, we first employ quantile partial correlation for screening, and then we apply the extended Bayesian information criterion (EBIC) for best subset selection. Our proposed method can successfully select predictors when the variables are highly correlated, and it can also identify variables that make a contribution to the conditional quantiles but are marginally uncorrelated or weakly correlated with the response. Theoretical results show that the proposed algorithm can yield the sure screening set. By controlling the false selection rate, model selection consistency can be achieved theoretically. In practice, we proposed using EBIC for best subset selection so that the resulting model is screening consistent. Simulation studies demonstrate that the proposed algorithm performs well, and an empirical example is presented. Supplementary materials for this article are available online.
UR - http://www.scopus.com/inward/record.url?scp=85016450688&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85016450688&partnerID=8YFLogxK
U2 - 10.1080/01621459.2016.1156545
DO - 10.1080/01621459.2016.1156545
M3 - Article
C2 - 28943683
AN - SCOPUS:85016450688
SN - 0162-1459
VL - 112
SP - 650
EP - 663
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 518
ER -