CloudSPAdes: Assembly of synthetic long reads using de Bruijn graphs

Ivan Tolstoganov, Anton Bankevich, Zhoutao Chen, Pavel A. Pevzner

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Motivation: The recently developed barcoding-based synthetic long read (SLR) technologies have already found many applications in genome assembly and analysis. However, although some new barcoding protocols are emerging and the range of SLR applications is being expanded, the existing SLR assemblers are optimized for a narrow range of parameters and are not easily extendable to new barcoding technologies and new applications such as metagenomics or hybrid assembly. Results: We describe the algorithmic challenge of the SLR assembly and present a cloudSPAdes algorithm for SLR assembly that is based on analyzing the de Bruijn graph of SLRs. We benchmarked cloudSPAdes across various barcoding technologies/applications and demonstrated that it improves on the state-of-the-art SLR assemblers in accuracy and speed.

Original languageEnglish (US)
Article numberbtz349
Pages (from-to)i61-i70
JournalBioinformatics
Volume35
Issue number14
DOIs
StatePublished - Jul 15 2019

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'CloudSPAdes: Assembly of synthetic long reads using de Bruijn graphs'. Together they form a unique fingerprint.

Cite this