TY - JOUR
T1 - RNA G-Quadruplexes in the model plant species Arabidopsis thaliana
T2 - Prevalence and possible functional roles
AU - Mullen, Melissa A.
AU - Olson, Kalee J.
AU - Dallaire, Paul
AU - Major, Franois
AU - Assmann, Sarah M.
AU - Bevilacqua, Philip C.
N1 - Funding Information:
National Science Foundation (MCB-0527102 to P.C.B.); National Science Foundation (MCB-03-45251 to S.M.A. and P.C.B.); Human Frontier Science Program (HFSP) (RGP0002/2009-C to P.C.B., S.M.A. and F.M.). Funding for open access charge: HFSP.
PY - 2010/12
Y1 - 2010/12
N2 - Tandem stretches of guanines can associate in hydrogen-bonded arrays to form G-quadruplexes, which are stabilized by K+ ions. Using computational methods, we searched for G-Quadruplex Sequence (GQS) patterns in the model plant species Arabidopsis thaliana. We found ∼1200 GQS with a G3 repeat sequence motif, most of which are located in the intergenic region. Using a Markov modeled genome, we determined that GQS are significantly underrepresented in the genome. Additionally, we found ∼43000 GQS with a G2 repeat sequence motif; notably, 80 of these were located in genic regions, suggesting that these sequences may fold at the RNA level. Gene Ontology functional analysis revealed that GQS are overrepresented in genes encoding proteins of certain functional categories, including enzyme activity. Conversely, GQS are underrepresented in other categories of genes, notably those for non-coding RNAs such as tRNAs and rRNAs. We also find that genes that are differentially regulated by drought are significantly more likely to contain a GQS. CD-detected K+ titrations performed on representative RNAs verified formation of quadruplexes at physiological K+ concentrations. Overall, this study indicates that GQS are present at unique locations in Arabidopsis and that folding of RNA GQS may play important roles in regulating gene expression.
AB - Tandem stretches of guanines can associate in hydrogen-bonded arrays to form G-quadruplexes, which are stabilized by K+ ions. Using computational methods, we searched for G-Quadruplex Sequence (GQS) patterns in the model plant species Arabidopsis thaliana. We found ∼1200 GQS with a G3 repeat sequence motif, most of which are located in the intergenic region. Using a Markov modeled genome, we determined that GQS are significantly underrepresented in the genome. Additionally, we found ∼43000 GQS with a G2 repeat sequence motif; notably, 80 of these were located in genic regions, suggesting that these sequences may fold at the RNA level. Gene Ontology functional analysis revealed that GQS are overrepresented in genes encoding proteins of certain functional categories, including enzyme activity. Conversely, GQS are underrepresented in other categories of genes, notably those for non-coding RNAs such as tRNAs and rRNAs. We also find that genes that are differentially regulated by drought are significantly more likely to contain a GQS. CD-detected K+ titrations performed on representative RNAs verified formation of quadruplexes at physiological K+ concentrations. Overall, this study indicates that GQS are present at unique locations in Arabidopsis and that folding of RNA GQS may play important roles in regulating gene expression.
UR - http://www.scopus.com/inward/record.url?scp=78650442734&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78650442734&partnerID=8YFLogxK
U2 - 10.1093/nar/gkq804
DO - 10.1093/nar/gkq804
M3 - Article
C2 - 20860998
AN - SCOPUS:78650442734
SN - 0305-1048
VL - 38
SP - 8149
EP - 8163
JO - Nucleic acids research
JF - Nucleic acids research
IS - 22
ER -