Analysis pipeline for the epistasis search - statistical versus biological filtering

Xiangqing Sun, Qing Lu, Shubhabrata Mukheerjee, Paul K. Crane, Robert Elston, Marylyn D. Ritchie

Research output: Contribution to journalShort surveypeer-review

52 Scopus citations


Gene-gene interactions may contribute to the genetic variation underlying complex traits but have not always been taken fully into account. Statistical analyses that consider gene-gene interaction may increase the power of detecting associations, especially for low-marginal-effect markers, and may explain in part the "missing heritability." Detecting pair-wise and higher-order interactions genome-wide requires enormous computational power. Filtering pipelines increase the computational speed by limiting the number of tests performed. We summarize existing filtering approaches to detect epistasis, after distinguishing the purposes that lead us to search for epistasis. Statistical filtering includes quality control on the basis of single marker statistics to avoid the analysis of bad and least informative data, and limits the search space for finding interactions. Biological filtering includes targeting specific pathways, integrating various databases based on known biological and metabolic pathways, gene function ontology and protein-protein interactions. It is increasingly possible to target single-nucleotide polymorphisms that have defined functions on gene expression, though not belonging to protein-coding genes. Filtering can improve the power of an interaction association study, but also increases the chance of missing important findings.

Original languageEnglish (US)
Article numberArticle 106
JournalFrontiers in Genetics
Issue numberAPR
StatePublished - 2014

All Science Journal Classification (ASJC) codes

  • Molecular Medicine
  • Genetics
  • Genetics(clinical)


Dive into the research topics of 'Analysis pipeline for the epistasis search - statistical versus biological filtering'. Together they form a unique fingerprint.

Cite this