TY - JOUR
T1 - Ribosomal DNA arrays are the most H-DNA rich element in the human genome
AU - Chantzi, Nikol
AU - Chan, Candace S.Y.
AU - Patsakis, Michail
AU - Nayak, Akshatha
AU - Montgomery, Austin
AU - Mouratidis, Ioannis
AU - Georgakopoulos-Soares, Ilias
N1 - Publisher Copyright:
© 2025 The Author(s).
PY - 2025/3/1
Y1 - 2025/3/1
N2 - Repetitive DNA sequences can form noncanonical structures such as H-DNA. The new telomere-to-telomere genome assembly for the human genome has eliminated gaps, enabling examination of highly repetitive regions including centromeric and pericentromeric repeats and ribosomal DNA arrays. We find that H-DNA appears once every 25 000 base pairs in the human genome. Its distribution is highly inhomogeneous with H-DNA motif hotspots being detectable in acrocentric chromosomes. Ribosomal DNA arrays are the genomic element with a 40.94-fold H-DNA enrichment. Across acrocentric chromosomes, we report that 54.82% of H-DNA motifs found in these chromosomes are in rDNA array loci. We discover that binding sites for the PRDM9-B allele, a variant of the PRDM9 protein, are enriched for H-DNA motifs. We further investigate these findings through an analysis of PRDM-9 ChIP-seq data across various PRDM-9 alleles, observing an enrichment of H-DNA motifs in the binding sites of A-like alleles (including A, B, and N alleles), but not C-like alleles (including C and L4 alleles). The enrichment of H-DNA motifs at ribosomal DNA arrays is consistent in nonhuman great ape genomes. We conclude that ribosomal DNA arrays are the most enriched genomic loci for H-DNA sequences in human and other great ape genomes.
AB - Repetitive DNA sequences can form noncanonical structures such as H-DNA. The new telomere-to-telomere genome assembly for the human genome has eliminated gaps, enabling examination of highly repetitive regions including centromeric and pericentromeric repeats and ribosomal DNA arrays. We find that H-DNA appears once every 25 000 base pairs in the human genome. Its distribution is highly inhomogeneous with H-DNA motif hotspots being detectable in acrocentric chromosomes. Ribosomal DNA arrays are the genomic element with a 40.94-fold H-DNA enrichment. Across acrocentric chromosomes, we report that 54.82% of H-DNA motifs found in these chromosomes are in rDNA array loci. We discover that binding sites for the PRDM9-B allele, a variant of the PRDM9 protein, are enriched for H-DNA motifs. We further investigate these findings through an analysis of PRDM-9 ChIP-seq data across various PRDM-9 alleles, observing an enrichment of H-DNA motifs in the binding sites of A-like alleles (including A, B, and N alleles), but not C-like alleles (including C and L4 alleles). The enrichment of H-DNA motifs at ribosomal DNA arrays is consistent in nonhuman great ape genomes. We conclude that ribosomal DNA arrays are the most enriched genomic loci for H-DNA sequences in human and other great ape genomes.
UR - http://www.scopus.com/inward/record.url?scp=86000614827&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=86000614827&partnerID=8YFLogxK
U2 - 10.1093/nargab/lqaf012
DO - 10.1093/nargab/lqaf012
M3 - Article
C2 - 40041207
AN - SCOPUS:86000614827
SN - 2631-9268
VL - 7
JO - NAR Genomics and Bioinformatics
JF - NAR Genomics and Bioinformatics
IS - 1
M1 - lqaf012
ER -