Skip to main navigation Skip to search Skip to main content

Accurate Detection of Tandem Repeats from Error-Prone Sequences with EquiRep

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Several critical tasks in biology such as detecting tandem repeats from error-prone long reads and reconstructing circular RNAs from rolling circle long-reads data, require solving a fundamental computational problem: given a sequence containing an unknown number of mutated copies of an unknown repeat unit, reconstruct the original unit. While several methods exist for this problem, they often exhibit low accuracy when the repeat unit length increases or the number of copies is low. Furthermore, methods capable of handling highly mutated sequences remain scarce, highlighting significant need for improvement. We introduce EquiRep, a tool for accurate detection of tandem repeats from error-prone sequences. By evaluating using simulated and real datasets we show that EquiRep consistently outperforms state-of-the-art methods. EquiRep is robust to sequencing errors, and is able to make better predictions for long units and low frequencies, underscoring its broad usability. EquiRep is freely available at https://github.com/Shao-Group/EquiRep. The full version of this manuscript is available at https://doi.org/10.1101/2024.11.05.621953.

Original languageEnglish (US)
Title of host publicationResearch in Computational Molecular Biology - 29th International Conference, RECOMB 2025, Proceedings
EditorsSriram Sankararaman
PublisherSpringer Science and Business Media Deutschland GmbH
Pages390-394
Number of pages5
ISBN (Print)9783031902512
DOIs
StatePublished - 2025
Event29th International Conference on Research in Computational Molecular Biology, RECOMB 2025 - Seoul, Korea, Republic of
Duration: Apr 26 2025Apr 29 2025

Publication series

NameLecture Notes in Computer Science
Volume15647 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th International Conference on Research in Computational Molecular Biology, RECOMB 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period4/26/254/29/25

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Accurate Detection of Tandem Repeats from Error-Prone Sequences with EquiRep'. Together they form a unique fingerprint.

Cite this