TY - JOUR
T1 - Dimension reduction for the conditional mean in regressions with categorical predictors
AU - Li, Bing
AU - Cook, R. Dennis
AU - Chiaromonte, Francesca
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2003/10
Y1 - 2003/10
N2 - Consider the regression of a response Y. on a vector of quantitative predictors X and a categorical predictor W. In this article we describe a first method for reducing the dimension of X without loss of information on the conditional mean E(Y|X, W) and without requiring a prespecified parametric model. The method, which allows for, but does not require, parametric versions of the subpopulation mean functions E(Y|X, W = w), includes a procedure for inference about the dimension of X after reduction. This work integrates previous studies on dimension reduction for the conditional mean E(Y|X) in the absence of categorical predictors and dimension reduction for the full conditional distribution of Y|(X, W). The methodology we describe may be particularly useful for constructing low-dimensional summary plots to aid in model-building at the outset of an analysis. Our proposals provide an often parsimonious alternative to the standard technique of modeling with interaction terms to adapt a mean function for different subpopulations determined by the levels of W. Examples illustrating this and other aspects of the development are presented.
AB - Consider the regression of a response Y. on a vector of quantitative predictors X and a categorical predictor W. In this article we describe a first method for reducing the dimension of X without loss of information on the conditional mean E(Y|X, W) and without requiring a prespecified parametric model. The method, which allows for, but does not require, parametric versions of the subpopulation mean functions E(Y|X, W = w), includes a procedure for inference about the dimension of X after reduction. This work integrates previous studies on dimension reduction for the conditional mean E(Y|X) in the absence of categorical predictors and dimension reduction for the full conditional distribution of Y|(X, W). The methodology we describe may be particularly useful for constructing low-dimensional summary plots to aid in model-building at the outset of an analysis. Our proposals provide an often parsimonious alternative to the standard technique of modeling with interaction terms to adapt a mean function for different subpopulations determined by the levels of W. Examples illustrating this and other aspects of the development are presented.
UR - http://www.scopus.com/inward/record.url?scp=0242679764&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0242679764&partnerID=8YFLogxK
U2 - 10.1214/aos/1065705121
DO - 10.1214/aos/1065705121
M3 - Article
AN - SCOPUS:0242679764
SN - 0090-5364
VL - 31
SP - 1636
EP - 1668
JO - Annals of Statistics
JF - Annals of Statistics
IS - 5
ER -