Drosophila Evolution over Space and Time (DEST): A New Population Genomics Resource

Martin Kapun, Joaquin C.B. Nunez, María Bogaerts-Márquez, Jesús Murga-Moreno, Margot Paris, Joseph Outten, Marta Coronado-Zamora, Courtney Tern, Omar Rota-Stabelli, Maria P. García Guerreiro, Sònia Casillas, Dorcas J. Orengo, Eva Puerma, Maaria Kankare, Lino Ometto, Volker Loeschcke, Banu S. Onder, Jessica K. Abbott, Stephen W. Schaeffer, Subhash RajpurohitEmily L. Behrman, Mads F. Schou, Thomas J.S. Merritt, Brian P. Lazzaro, Amanda Glaser-Schmitt, Eliza Argyridou, Fabian Staubach, Yun Wang, Eran Tauber, Svitlana V. Serga, Daniel K. Fabian, Kelly A. Dyer, Christopher W. Wheat, John Parsch, Sonja Grath, Marija Savic Veselinovic, Marina Stamenkovic-Radak, Mihailo Jelic, Antonio J. Buendía-Ruíz, Maria Josefa Gómez-Julián, Maria Luisa Espinosa-Jimenez, Francisco D. Gallardo-Jiménez, Aleksandra Patenkovic, Katarina Eric, Marija Tanaskovic, Anna Ullastres, Lain Guio, Miriam Merenciano, Sara Guirao-Rico, Vivien Horváth, Darren J. Obbard, Elena Pasyukova, Vladimir E. Alatortsev, Cristina P. Vieira, Jorge Vieira, Jorge Roberto Torres, Iryna Kozeretska, Oleksandr M. Maistrenko, Catherine Montchamp-Moreau, Dmitry V. Mukha, Heather E. Machado, Keric Lamb, Tânia Paulo, Leeban Yusuf, Antonio Barbadilla, Dmitri Petrov, Paul Schmidt, Josefa Gonzalez, Thomas Flatt, Alan O. Bergland

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome data sets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate data sets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in >20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This data set, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental metadata. A web-based genome browser and web portal provide easy access to the SNP data set. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan data set. Our resource will enable population geneticists to analyze spatiotemporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.

Original languageEnglish (US)
Pages (from-to)5782-5805
Number of pages24
JournalMolecular biology and evolution
Volume38
Issue number12
DOIs
StatePublished - 2021

All Science Journal Classification (ASJC) codes

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics

Fingerprint

Dive into the research topics of 'Drosophila Evolution over Space and Time (DEST): A New Population Genomics Resource'. Together they form a unique fingerprint.

Cite this