Abstract
Large-scale sequence-based association analysis is a powerful approach to identify rare variants involved in complex trait etiologies. Confirmation of significant findings in stage 1 through replication in an independent stage 2 sample is necessary to avoid reporting spurious results. For gene-based mapping of rare variants, where rare variants within a region are analyzed in aggregate, three replication strategies are possible: (1) variant-based replication, wherein only variants from nucleotide sites uncovered in stage 1 within the gene region are genotyped and followed up; (2) sequence-based replication, wherein the gene region is sequenced in the replication sample and both known and novel variants are tested; and (3) exome-array-based replication, where the identified gene region in the stage 1 sample is followed up using exome arrays in the stage 2 sample. The efficiency of the three strategies is dependent on the proportions of causative variants discovered in stage 1, sequencing/genotyping errors, trait-specific genetic architecture, as well as how many variants within the identified gene region are available for genotyping on the exome array. With rigorous population genetic and phenotypic models, it is demonstrated that sequence-based replication is consistently more powerful than variant- and exome-array-based replication, although the power gain can be small. For variant-based replication, if the stage 1 sample consists of several thousands of individuals, a large fraction of causative variant sites can be observed, and even for smaller stage 1 studies, a large proportion of the locus population attributable risk can be explained by the uncovered variants. Exome-array-based replication can have comparable power to the other two approaches if coding variants driving the association are well represented. As a consequence, although sequence-based replication is usually more powerful and also valuable to identify novel potentially causal variants, both variant- and exome-array-based replication can be a viable and cost-effective approach for replicating rare variant associations.
Original language | English (US) |
---|---|
Title of host publication | Assessing Rare Variation in Complex Traits |
Subtitle of host publication | Design and Analysis of Genetic Studies |
Publisher | Springer New York |
Pages | 201-213 |
Number of pages | 13 |
ISBN (Electronic) | 9781493928248 |
ISBN (Print) | 9781493928231 |
DOIs | |
State | Published - Jan 1 2015 |
All Science Journal Classification (ASJC) codes
- General Medicine
- General Biochemistry, Genetics and Molecular Biology