TY - JOUR
T1 - A ROBUST CONSISTENT INFORMATION CRITERION FOR MODEL SELECTION BASED ON EMPIRICAL LIKELIHOOD
AU - Chen, Chixiang
AU - Wang, Ming
AU - Wu, Rongling
AU - Li, Runze
N1 - Publisher Copyright:
© 2022 Institute of Statistical Science. All rights reserved.
PY - 2022/7
Y1 - 2022/7
N2 - Conventional likelihood-based information criteria for model selection rely on the assumed distribution of the data. However, for complex data, specifying this underlying distribution turns out to be challenging, and existing criteria may be limited and not sufficiently general to handle various model-selection problems. Here, we propose a robust and consistent model-selection criterion based on the empirical likelihood function, which is data driven. In particular, this framework adopts plug-in estimators that can be achieved by solving external estimating equations not limited to the empirical likelihood. This avoids potential computational-convergence issues and allows for versatile applications, such as generalized linear models, generalized estimating equations, and penalized regressions. The proposed criterion is derived initially from the asymptotic expansion of the marginal likelihood under a variable-selection framework, but more importantly, the consistent model-selection property is established in a general context. Extensive simulation studies confirm that the proposed model-selection criterion outperforms traditional criteria. Finally, an application to the Atherosclerosis Risk in Communities Study illustrates the practical value of the proposed framework.
AB - Conventional likelihood-based information criteria for model selection rely on the assumed distribution of the data. However, for complex data, specifying this underlying distribution turns out to be challenging, and existing criteria may be limited and not sufficiently general to handle various model-selection problems. Here, we propose a robust and consistent model-selection criterion based on the empirical likelihood function, which is data driven. In particular, this framework adopts plug-in estimators that can be achieved by solving external estimating equations not limited to the empirical likelihood. This avoids potential computational-convergence issues and allows for versatile applications, such as generalized linear models, generalized estimating equations, and penalized regressions. The proposed criterion is derived initially from the asymptotic expansion of the marginal likelihood under a variable-selection framework, but more importantly, the consistent model-selection property is established in a general context. Extensive simulation studies confirm that the proposed model-selection criterion outperforms traditional criteria. Finally, an application to the Atherosclerosis Risk in Communities Study illustrates the practical value of the proposed framework.
UR - http://www.scopus.com/inward/record.url?scp=85161725758&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85161725758&partnerID=8YFLogxK
U2 - 10.5705/ss.202020.0254
DO - 10.5705/ss.202020.0254
M3 - Article
AN - SCOPUS:85161725758
SN - 1017-0405
VL - 32
SP - 1205
EP - 1223
JO - Statistica Sinica
JF - Statistica Sinica
IS - 3
ER -