TY - JOUR
T1 - Detection of Shared Balancing Selection in the Absence of Trans-Species Polymorphism
AU - Cheng, Xiaoheng
AU - Degiorgio, Michael
N1 - Funding Information:
We thank Michelle S. Kim for her contribution in testing the draft software for two-species versions of the Ttrans statistics, Javier Prado-Martinez for providing PSMC estimates of great ape demographic history estimates from Prado-Martinez et al. (2013), and Jonathan Terhorst for providing SMC++ human demographic history estimates from Terhorst et al. (2017). We also thank two anonymous reviewers for their constructive feedback that helped strengthen this manuscript. This work was funded by National Institutes of Health (Grant R35GM128590), by the Alfred P. Sloan Foundation, and by Pennsylvania State University startup funds. Portions of this research were conducted with Advanced CyberInfrastructure computational resources provided by the Institute for CyberScience at Pennsylvania State University. This study makes use of the copy number variation data generated by DECIPHER (DatabasE of genomiC varIation and Phenotype in Humans using Ensembl Resources) community, which was funded by Wellcome Trust, and can be accessed at http://decipher.sanger.ac.uk, or via email from decipher@sanger.ac.uk.
Publisher Copyright:
© The Author 2018.
PY - 2019/1/1
Y1 - 2019/1/1
N2 - Trans-species polymorphism has been widely used as a key sign of long-term balancing selection across multiple species. However, such sites are often rare in the genome and could result from mutational processes or technical artifacts. Few methods are yet available to specifically detect footprints of trans-species balancing selection without using trans-species polymorphic sites. In this study, we develop summary- and model-based approaches that are each specifically tailored to uncover regions of long-term balancing selection shared by a set of species by using genomic patterns of intraspecific polymorphism and interspecific fixed differences. We demonstrate that our trans-species statistics have substantially higher power than single-species approaches to detect footprints of trans-species balancing selection, and are robust to those that do not affect all tested species. We further apply our model-based methods to human and chimpanzee whole-genome sequencing data. In addition to the previously established major histocompatibility complex and malaria resistance-associated FREM3/GYPE regions, we also find outstanding genomic regions involved in barrier integrity and innate immunity, such as the GRIK1/CLDN17 intergenic region, and the SLC35F1 and ABCA13 genes. Our findings not only echo the significance of pathogen defense but also reveal novel candidates in maintaining balanced polymorphisms across human and chimpanzee lineages. Finally, we show that these trans-species statistics can be applied to and work well for an arbitrary number of species, and integrate them into open-source software packages for ease of use by the scientific community.
AB - Trans-species polymorphism has been widely used as a key sign of long-term balancing selection across multiple species. However, such sites are often rare in the genome and could result from mutational processes or technical artifacts. Few methods are yet available to specifically detect footprints of trans-species balancing selection without using trans-species polymorphic sites. In this study, we develop summary- and model-based approaches that are each specifically tailored to uncover regions of long-term balancing selection shared by a set of species by using genomic patterns of intraspecific polymorphism and interspecific fixed differences. We demonstrate that our trans-species statistics have substantially higher power than single-species approaches to detect footprints of trans-species balancing selection, and are robust to those that do not affect all tested species. We further apply our model-based methods to human and chimpanzee whole-genome sequencing data. In addition to the previously established major histocompatibility complex and malaria resistance-associated FREM3/GYPE regions, we also find outstanding genomic regions involved in barrier integrity and innate immunity, such as the GRIK1/CLDN17 intergenic region, and the SLC35F1 and ABCA13 genes. Our findings not only echo the significance of pathogen defense but also reveal novel candidates in maintaining balanced polymorphisms across human and chimpanzee lineages. Finally, we show that these trans-species statistics can be applied to and work well for an arbitrary number of species, and integrate them into open-source software packages for ease of use by the scientific community.
UR - http://www.scopus.com/inward/record.url?scp=85060390837&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060390837&partnerID=8YFLogxK
U2 - 10.1093/molbev/msy202
DO - 10.1093/molbev/msy202
M3 - Article
C2 - 30380122
AN - SCOPUS:85060390837
SN - 0737-4038
VL - 36
SP - 177
EP - 199
JO - Molecular biology and evolution
JF - Molecular biology and evolution
IS - 1
ER -