TY - JOUR
T1 - Power of grammatical evolution neural networks to detect gene-gene interactions in the presence of error
AU - Motsinger-Reif, Alison A.
AU - Fanelli, Theresa J.
AU - Davis, Anna C.
AU - Ritchie, Marylyn D.
N1 - Funding Information:
The work for this project was supported by National Institutes of Health grants HL65962, GM62758, and AG20135.
PY - 2008
Y1 - 2008
N2 - Background: With the advent of increasingly efficient means to obtain genetic information, a great insurgence of data has resulted, leading to the need for methods for analyzing this data beyond that of traditional parametric statistical approaches. Recently we introduced Grammatical Evolution Neural Network (GENN), a machine-learning approach to detect gene-gene or gene-environment interactions, also known as epistasis, in high dimensional genetic epidemiological data. GENN has been shown to be highly successful in a range of simulated data, but the impact of error common to real data is unknown. In the current study, we examine the power of GENN to detect interesting interactions in the presence of noise due to genotyping error, missing data, phenocopy, and genetic heterogeneity. Additionally, we compare the performance of GENN to that of another computational method - Multifactor Dimensionality Reduction (MDR). Findings: GENN is extremely robust to missing data and genotyping error. Phenocopy in a dataset reduces the power of both GENN and MDR. GENN is reasonably robust to genetic heterogeneity and find that in some cases GENN has substantially higher power than MDR to detect functional loci in the presence of genetic heterogeneity. Conclusion: GENN is a promising method to detect gene-gene interaction, even in the presence of common types of error found in real data.
AB - Background: With the advent of increasingly efficient means to obtain genetic information, a great insurgence of data has resulted, leading to the need for methods for analyzing this data beyond that of traditional parametric statistical approaches. Recently we introduced Grammatical Evolution Neural Network (GENN), a machine-learning approach to detect gene-gene or gene-environment interactions, also known as epistasis, in high dimensional genetic epidemiological data. GENN has been shown to be highly successful in a range of simulated data, but the impact of error common to real data is unknown. In the current study, we examine the power of GENN to detect interesting interactions in the presence of noise due to genotyping error, missing data, phenocopy, and genetic heterogeneity. Additionally, we compare the performance of GENN to that of another computational method - Multifactor Dimensionality Reduction (MDR). Findings: GENN is extremely robust to missing data and genotyping error. Phenocopy in a dataset reduces the power of both GENN and MDR. GENN is reasonably robust to genetic heterogeneity and find that in some cases GENN has substantially higher power than MDR to detect functional loci in the presence of genetic heterogeneity. Conclusion: GENN is a promising method to detect gene-gene interaction, even in the presence of common types of error found in real data.
UR - http://www.scopus.com/inward/record.url?scp=67650667086&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67650667086&partnerID=8YFLogxK
U2 - 10.1186/1756-0500-1-65
DO - 10.1186/1756-0500-1-65
M3 - Article
C2 - 18710518
AN - SCOPUS:67650667086
SN - 1756-0500
VL - 1
JO - BMC Research Notes
JF - BMC Research Notes
M1 - 65
ER -