TY - JOUR
T1 - Quadrupia provides a comprehensive catalog of G-quadruplexes across genomes from the tree of life
AU - Chantzi, Nikol
AU - Nayak, Akshatha
AU - Baltoumas, Fotis A.
AU - Aplakidou, Eleni
AU - Liew, Shiau Wei
AU - Galuh, Jesslyn Elvaretta
AU - Patsakis, Michail
AU - Montgomery, Austin
AU - Moeckel, Camille
AU - Mouratidis, Ioannis
AU - Sazed, Saiful Arefeen
AU - Guiblet, Wilfried
AU - Karmiris-Obratański, Panagiotis
AU - Wang, Guliang
AU - Zaravinos, Apostolos
AU - Vasquez, Karen M.
AU - Kwok, Chun Kit
AU - Pavlopoulos, Georgios A.
AU - Georgakopoulos-Soares, Ilias
N1 - Publisher Copyright:
© 2025 Chantzi et al.
PY - 2025/11
Y1 - 2025/11
N2 - G-quadruplex DNA structures exhibit a profound influence on essential biological processes, including transcription, replication, telomere maintenance, and genomic stability. These structures have demonstrably shaped organismal evolution. However, a comprehensive, organism-wide G-quadruplex map encompassing the diversity of life has remained elusive. Here, we introduce Quadrupia, the most extensive and well-characterized G-quadruplex database to date, facilitating the exploration of G-quadruplex structures across the evolutionary spectrum. Quadrupia has identified G-quadruplex sequences in 108,449 reference genomes, with a total of 140,181,277 G-quadruplexes. The database also hosts a collection of 319,784 G-quadruplex clusters of 20 or more members, annotated by taxonomic distributions, multiple sequence alignments, profile hidden Markov models, and cross-references to G-quadruplex 3D structures. Examination of G-quadruplexes across functional genomic elements in different taxa indicates preferential orientation and positioning, with significant differences between individual taxonomic groups. For example, we find that G-quadruplexes in bacteria with a single replication origin display profound preference for the leading orientation. Finally, we experimentally validate the most frequently observed G-quadruplexes using CD-spectroscopy, UV melting, and fluorescent-based approaches.
AB - G-quadruplex DNA structures exhibit a profound influence on essential biological processes, including transcription, replication, telomere maintenance, and genomic stability. These structures have demonstrably shaped organismal evolution. However, a comprehensive, organism-wide G-quadruplex map encompassing the diversity of life has remained elusive. Here, we introduce Quadrupia, the most extensive and well-characterized G-quadruplex database to date, facilitating the exploration of G-quadruplex structures across the evolutionary spectrum. Quadrupia has identified G-quadruplex sequences in 108,449 reference genomes, with a total of 140,181,277 G-quadruplexes. The database also hosts a collection of 319,784 G-quadruplex clusters of 20 or more members, annotated by taxonomic distributions, multiple sequence alignments, profile hidden Markov models, and cross-references to G-quadruplex 3D structures. Examination of G-quadruplexes across functional genomic elements in different taxa indicates preferential orientation and positioning, with significant differences between individual taxonomic groups. For example, we find that G-quadruplexes in bacteria with a single replication origin display profound preference for the leading orientation. Finally, we experimentally validate the most frequently observed G-quadruplexes using CD-spectroscopy, UV melting, and fluorescent-based approaches.
UR - https://www.scopus.com/pages/publications/105020743982
UR - https://www.scopus.com/pages/publications/105020743982#tab=citedBy
U2 - 10.1101/gr.279790.124
DO - 10.1101/gr.279790.124
M3 - Article
C2 - 40858360
AN - SCOPUS:105020743982
SN - 1088-9051
VL - 35
SP - 2578
EP - 2600
JO - Genome research
JF - Genome research
IS - 11
ER -