TY - JOUR
T1 - Reduction of sample heterogeneity through use of population substructure
T2 - An example from a population of African American families with sarcoidosis
AU - Thompson, Cheryl L.
AU - Rybicki, Benjamin A.
AU - Iannuzzi, Michael C.
AU - Elston, Robert C.
AU - Iyengar, Sudha K.
AU - Gray-McGuire, Courtney
N1 - Funding Information:
We thank all family members who participated in the SAGA study, and we thank the reviewers for their insightful comments. This study was supported by National Heart, Lung, and Blood Institute training grant HL07567 and National Cancer Institute training grant CA094186, as well as National Institutes of Health grants UO1 HL060263, RO1 GM28356, and P41 RR003655 and National Center for Research Resources grant RR03655.
PY - 2006/10
Y1 - 2006/10
N2 - Sarcoidosis is a granulomatous inflammatory disorder of complex etiology with significant linkage to chromosome 5, and marginal linkage was observed to five other chromosomes in African Americans (AAs) in our previously published genome scan. Because genetic factors underlying complex disease are often population specific, genetic analysis of samples with diverse ancestry (i.e., ethnic confounding) can lead to loss of power. Ethnic confounding is often addressed by stratifying on self-reported race, a controversial and less-than-perfect construct. Here, we propose linkage analysis stratified by genetically determined ancestry as an alternative approach for reducing ethnic confounding. Using data from the 380 microsatellite markers genotyped in the aforementioned genome scan, we clustered AA families into subpopulations on the basis of ancestry similarity. Evidence of two genetically distinct groups was found: subpopulation one (S1) comprised 219 of the 229 families, subpopulation two (S2) consisted of six families (the remaining four families were a mixture). Stratified linkage results suggest that only the S1 families contributed to previously identified linkage signals at 1p22, 3p21-14, 11p15, and 17q21 and that only the S2 families contributed to those found at 5p15-13 and 20q13. Signals on 2p25, 5q11, 5q35, and 9q34 remained significant in both subpopulations, and evidence of a new susceptibility locus at 2q37 was found in S2. These results demonstrate the usefulness of stratifying on genetically determined ancestry, to create genetically homogeneous subsets-more reliable and less controversial than race-stratified subsets-in which to identify genetic factors. Our findings support the presence of sarcoidosis-susceptibility genes in regions identified elsewhere but indicate that these genes are likely to be ancestry specific.
AB - Sarcoidosis is a granulomatous inflammatory disorder of complex etiology with significant linkage to chromosome 5, and marginal linkage was observed to five other chromosomes in African Americans (AAs) in our previously published genome scan. Because genetic factors underlying complex disease are often population specific, genetic analysis of samples with diverse ancestry (i.e., ethnic confounding) can lead to loss of power. Ethnic confounding is often addressed by stratifying on self-reported race, a controversial and less-than-perfect construct. Here, we propose linkage analysis stratified by genetically determined ancestry as an alternative approach for reducing ethnic confounding. Using data from the 380 microsatellite markers genotyped in the aforementioned genome scan, we clustered AA families into subpopulations on the basis of ancestry similarity. Evidence of two genetically distinct groups was found: subpopulation one (S1) comprised 219 of the 229 families, subpopulation two (S2) consisted of six families (the remaining four families were a mixture). Stratified linkage results suggest that only the S1 families contributed to previously identified linkage signals at 1p22, 3p21-14, 11p15, and 17q21 and that only the S2 families contributed to those found at 5p15-13 and 20q13. Signals on 2p25, 5q11, 5q35, and 9q34 remained significant in both subpopulations, and evidence of a new susceptibility locus at 2q37 was found in S2. These results demonstrate the usefulness of stratifying on genetically determined ancestry, to create genetically homogeneous subsets-more reliable and less controversial than race-stratified subsets-in which to identify genetic factors. Our findings support the presence of sarcoidosis-susceptibility genes in regions identified elsewhere but indicate that these genes are likely to be ancestry specific.
UR - http://www.scopus.com/inward/record.url?scp=33748992869&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33748992869&partnerID=8YFLogxK
U2 - 10.1086/507847
DO - 10.1086/507847
M3 - Article
C2 - 16960797
AN - SCOPUS:33748992869
SN - 0002-9297
VL - 79
SP - 606
EP - 613
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 4
ER -