Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions

Gustavo de los Campos, Yogasudha Veturi, Ana I. Vazquez, Christina Lehermeier, Paulino Pérez-Rodríguez

Research output: Contribution to journalArticlepeer-review

33 Scopus citations

Abstract

Naturally and artificially selected populations usually exhibit some degree of stratification. In Genome-Wide Association Studies and in Whole-Genome Regressions (WGR) analyses, population stratification has been either ignored or dealt with as a potential confounder. However, systematic differences in allele frequency and in patterns of linkage disequilibrium can induce sub-population-specific effects. From this perspective, structure acts as an effect modifier rather than as a confounder. In this article, we extend WGR models commonly used in plant and animal breeding to allow for sub-population-specific effects. This is achieved by decomposing marker effects into main effects and interaction components that describe group-specific deviations. The model can be used both with variable selection and shrinkage methods and can be implemented using existing software for genomic selection. Using a wheat and a pig breeding data set, we compare parameter estimates and the prediction accuracy of the interaction WGR model with WGR analysis ignoring population stratification (across-group analysis) and with a stratified (i.e., within-sub-population) WGR analysis. The interaction model renders trait-specific estimates of the average correlation of effects between sub-populations; we find that such correlation not only depends on the extent of genetic differentiation in allele frequencies between groups but also varies among traits. The evaluation of prediction accuracy shows a modest superiority of the interaction model relative to the other two approaches. This superiority is the result of better stability in performance of the interaction models across data sets and traits; indeed, in almost all cases, the interaction model was either the best performing model or it performed close to the best performing model.

Original languageEnglish (US)
Pages (from-to)467-490
Number of pages24
JournalJournal of Agricultural, Biological, and Environmental Statistics
Volume20
Issue number4
DOIs
StatePublished - Dec 1 2015

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • General Environmental Science
  • Agricultural and Biological Sciences (miscellaneous)
  • General Agricultural and Biological Sciences
  • Statistics, Probability and Uncertainty
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Incorporating Genetic Heterogeneity in Whole-Genome Regressions Using Interactions'. Together they form a unique fingerprint.

Cite this