TY - JOUR
T1 - Inference of the distribution of selection coefficients for new nonsynonymous mutations using large samples
AU - Kim, Bernard Y.
AU - Huber, Christian D.
AU - Lohmueller, Kirk E.
N1 - Publisher Copyright:
© 2017 by the Genetics Society of America.
PY - 2017/5
Y1 - 2017/5
N2 - The distribution of fitness effects (DFE) has considerable importance in population genetics. To date, estimates of the DFE come from studies using a small number of individuals. Thus, estimates of the proportion of moderately to strongly deleterious new mutations may be unreliable because such variants are unlikely to be segregating in the data. Additionally, the true functional form of the DFE is unknown, and estimates of the DFE differ significantly between studies. Here we present a flexible and computationally tractable method, called Fit∂a∂i, to estimate the DFE of new mutations using the site frequency spectrum from a large number of individuals. We apply our approach to the frequency spectrum of 1300 Europeans from the Exome Sequencing Project ESP6400 data set, 1298 Danes from the LuCamp data set, and 432 Europeans from the 1000 Genomes Project to estimate the DFE of deleterious nonsynonymous mutations. We infer significantly fewer (0.38-0.84 fold) strongly deleterious mutations with selection coefficient |s| < 0.01 and more (1.24-1.43 fold) weakly deleterious mutations with selection coefficient |s| <0.001 compared to previous estimates. Furthermore, a DFE that is a mixture distribution of a point mass at neutrality plus a gamma distribution fits better than a gamma distribution in two of the three data sets. Our results suggest that nearly neutral forces play a larger role in human evolution than previously thought.
AB - The distribution of fitness effects (DFE) has considerable importance in population genetics. To date, estimates of the DFE come from studies using a small number of individuals. Thus, estimates of the proportion of moderately to strongly deleterious new mutations may be unreliable because such variants are unlikely to be segregating in the data. Additionally, the true functional form of the DFE is unknown, and estimates of the DFE differ significantly between studies. Here we present a flexible and computationally tractable method, called Fit∂a∂i, to estimate the DFE of new mutations using the site frequency spectrum from a large number of individuals. We apply our approach to the frequency spectrum of 1300 Europeans from the Exome Sequencing Project ESP6400 data set, 1298 Danes from the LuCamp data set, and 432 Europeans from the 1000 Genomes Project to estimate the DFE of deleterious nonsynonymous mutations. We infer significantly fewer (0.38-0.84 fold) strongly deleterious mutations with selection coefficient |s| < 0.01 and more (1.24-1.43 fold) weakly deleterious mutations with selection coefficient |s| <0.001 compared to previous estimates. Furthermore, a DFE that is a mixture distribution of a point mass at neutrality plus a gamma distribution fits better than a gamma distribution in two of the three data sets. Our results suggest that nearly neutral forces play a larger role in human evolution than previously thought.
UR - http://www.scopus.com/inward/record.url?scp=85020727089&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85020727089&partnerID=8YFLogxK
U2 - 10.1534/genetics.116.197145
DO - 10.1534/genetics.116.197145
M3 - Article
C2 - 28249985
AN - SCOPUS:85020727089
SN - 0016-6731
VL - 206
SP - 345
EP - 361
JO - Genetics
JF - Genetics
IS - 1
ER -