Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads

Laura H. Tung, Mingfu Shao, Carl Kingsford

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Single-molecule long-read sequencing has been used to improve mRNA isoform identification. However, not all single-molecule long reads represent full transcripts due to incomplete cDNA synthesis and sequencing length limits. This drives a need for long-read transcript assembly. By adding long-read-specific optimizations to Scallop, we developed Scallop-LR, a reference-based long-read transcript assembler. Analyzing 26 PacBio samples, we quantified the benefit of performing transcript assembly on long reads. We demonstrate Scallop-LR identifies more known transcripts and potentially novel isoforms for the human transcriptome than Iso-Seq Analysis and StringTie, indicating that long-read transcript assembly by Scallop-LR can reveal a more complete human transcriptome.

Original languageEnglish (US)
Article number287
JournalGenome biology
Volume20
Issue number1
DOIs
StatePublished - Dec 18 2019

All Science Journal Classification (ASJC) codes

  • Ecology, Evolution, Behavior and Systematics
  • Genetics
  • Cell Biology

Fingerprint

Dive into the research topics of 'Quantifying the benefit offered by transcript assembly with Scallop-LR on single-molecule long reads'. Together they form a unique fingerprint.

Cite this