TY - JOUR
T1 - Genetic programming neural networks
T2 - A powerful bioinformatics tool for human genetics
AU - Ritchie, Marylyn D.
AU - Motsinger, Alison A.
AU - Bush, William S.
AU - Coffey, Christopher S.
AU - Moore, Jason H.
N1 - Funding Information:
This work was supported by National Institutes of Health grants HL65234, HL65962, GM31304, AG19085, AG20135, AI59694, HD047447, and LM007450.
PY - 2007/1
Y1 - 2007/1
N2 - The identification of genes that influence the risk of common, complex disease primarily through interactions with other genes and environmental factors remains a statistical and computational challenge in genetic epidemiology. This challenge is partly due to the limitations of parametric statistical methods for detecting genetic effects that are dependent solely or partially on interactions. We have previously introduced a genetic programming neural network (GPNN) as a method for optimizing the architecture of a neural network to improve the identification of genetic and gene-environment combinations associated with disease risk. Previous empirical studies suggest GPNN has excellent power for identifying gene-gene and gene-environment interactions. The goal of this study was to compare the power of GPNN to stepwise logistic regression (SLR) and classification and regression trees (CART) for identifying gene-gene and gene-environment interactions. SLR and CART are standard methods of analysis for genetic association studies. Using simulated data, we show that GPNN has higher power to identify gene-gene and gene-environment interactions than SLR and CART. These results indicate that GPNN may be a useful pattern recognition approach for detecting gene-gene and gene-environment interactions in studies of human disease.
AB - The identification of genes that influence the risk of common, complex disease primarily through interactions with other genes and environmental factors remains a statistical and computational challenge in genetic epidemiology. This challenge is partly due to the limitations of parametric statistical methods for detecting genetic effects that are dependent solely or partially on interactions. We have previously introduced a genetic programming neural network (GPNN) as a method for optimizing the architecture of a neural network to improve the identification of genetic and gene-environment combinations associated with disease risk. Previous empirical studies suggest GPNN has excellent power for identifying gene-gene and gene-environment interactions. The goal of this study was to compare the power of GPNN to stepwise logistic regression (SLR) and classification and regression trees (CART) for identifying gene-gene and gene-environment interactions. SLR and CART are standard methods of analysis for genetic association studies. Using simulated data, we show that GPNN has higher power to identify gene-gene and gene-environment interactions than SLR and CART. These results indicate that GPNN may be a useful pattern recognition approach for detecting gene-gene and gene-environment interactions in studies of human disease.
UR - http://www.scopus.com/inward/record.url?scp=33750935113&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33750935113&partnerID=8YFLogxK
U2 - 10.1016/j.asoc.2006.01.013
DO - 10.1016/j.asoc.2006.01.013
M3 - Article
C2 - 20948988
AN - SCOPUS:33750935113
SN - 1568-4946
VL - 7
SP - 471
EP - 479
JO - Applied Soft Computing Journal
JF - Applied Soft Computing Journal
IS - 1
ER -