TY - JOUR
T1 - Enabling high-throughput genotype-phenotype associations in the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) project as part of the Population Architecture using Genomics and Epidemiology (PAGE) study
AU - Bush, William S.
AU - Boston, Jonathan
AU - Pendergrass, Sarah A.
AU - Dumitrescu, Logan
AU - Goodloe, Robert
AU - Brown-Gentry, Kristin
AU - Wilson, Sarah
AU - McClellan, Bob
AU - Torstenson, Eric
AU - Basford, Melissa A.
AU - Spencer, Kylee L.
AU - Ritchie, Marylyn D.
AU - Crawford, Dana C.
PY - 2013
Y1 - 2013
N2 - Genetic association studies have rapidly become a major tool for identifying the genetic basis of common human diseases. The advent of cost-effective genotyping coupled with large collections of samples linked to clinical outcomes and quantitative traits now make it possible to systematically characterize genotype-phenotype relationships in diverse populations and extensive datasets. To capitalize on these advancements, the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) project, as part of the collaborative Population Architecture using Genomics and Epidemiology (PAGE) study, accesses two collections: the National Health and Nutrition Examination Surveys (NHANES) and BioVU, Vanderbilt University's biorepository linked to de-identified electronic medical records. We describe herein the workflows for accessing and using the epidemiologic (NHANES) and clinical (BioVU) collections, where each workflow has been customized to reflect the content and data access limitations of each respective source. We also describe the process by which these data are generated, standardized, and shared for metaanalysis among the PAGE study sites. As a specific example of the use of BioVU, we describe the data mining efforts to define cases and controls for genetic association studies of common cancers in PAGE. Collectively, the efforts described here are a generalized outline for many of the successful approaches that can be used in the era of high-throughput genotype-phenotype associations for moving biomedical discovery forward to new frontiers of data generation and analysis.
AB - Genetic association studies have rapidly become a major tool for identifying the genetic basis of common human diseases. The advent of cost-effective genotyping coupled with large collections of samples linked to clinical outcomes and quantitative traits now make it possible to systematically characterize genotype-phenotype relationships in diverse populations and extensive datasets. To capitalize on these advancements, the Epidemiologic Architecture for Genes Linked to Environment (EAGLE) project, as part of the collaborative Population Architecture using Genomics and Epidemiology (PAGE) study, accesses two collections: the National Health and Nutrition Examination Surveys (NHANES) and BioVU, Vanderbilt University's biorepository linked to de-identified electronic medical records. We describe herein the workflows for accessing and using the epidemiologic (NHANES) and clinical (BioVU) collections, where each workflow has been customized to reflect the content and data access limitations of each respective source. We also describe the process by which these data are generated, standardized, and shared for metaanalysis among the PAGE study sites. As a specific example of the use of BioVU, we describe the data mining efforts to define cases and controls for genetic association studies of common cancers in PAGE. Collectively, the efforts described here are a generalized outline for many of the successful approaches that can be used in the era of high-throughput genotype-phenotype associations for moving biomedical discovery forward to new frontiers of data generation and analysis.
UR - http://www.scopus.com/inward/record.url?scp=84891474935&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84891474935&partnerID=8YFLogxK
M3 - Conference article
C2 - 23424142
AN - SCOPUS:84891474935
SN - 2335-6928
SP - 373
EP - 384
JO - Pacific Symposium on Biocomputing
JF - Pacific Symposium on Biocomputing
T2 - 18th Pacific Symposium on Biocomputing, PSB 2013
Y2 - 3 January 2013 through 7 January 2013
ER -