Efficient estimation of population-level summaries in general semiparametric regression models

Arnab Maity, Yanyuan Ma, Raymond J. Carroll

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

This article considers a wide class of semiparametric regression models in which interest focuses on population-level quantities that combine both the parametric and the nonparametric parts of the model. Special cases in this approach include generalized partially linear models, generalized partially linear single-index models, structural measurement error models, and many others. For estimating the parametric part of the model efficiently, profile likelihood kernel estimation methods are well established in the literature. Here our focus is on estimating general population-level quantities that combine the parametric and nonparametric parts of the model (e.g., population mean, probabilities, etc.). We place this problem in a general context, provide a general kernel-based methodology, and derive the asymptotic distributions of estimates of these population-level quantities, showing that in many cases the estimates are semiparametric efficient. For estimating the population mean with no missing data, we show that the sample mean is semiparametric efficient for canonical exponential families, but not in general. We apply the methods to a problem in nutritional epidemiology, where estimating the distribution of usual intake is of primary interest and semiparametric methods are not available. Extensions to the case of missing response data are also discussed.

Original languageEnglish (US)
Pages (from-to)123-139
Number of pages17
JournalJournal of the American Statistical Association
Volume102
Issue number477
DOIs
StatePublished - Mar 2007

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Efficient estimation of population-level summaries in general semiparametric regression models'. Together they form a unique fingerprint.

Cite this