TY - JOUR
T1 - Modeling RNA secondary structure folding ensembles using SHAPE mapping data
AU - Spasic, Aleksandar
AU - Assmann, Sarah M.
AU - Bevilacqua, Philip C.
AU - Mathews, David H.
N1 - Publisher Copyright:
© The Author(s) 2017.
PY - 2018/1/9
Y1 - 2018/1/9
N2 - RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.
AB - RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.
UR - http://www.scopus.com/inward/record.url?scp=85045194833&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85045194833&partnerID=8YFLogxK
U2 - 10.1093/nar/gkx1057
DO - 10.1093/nar/gkx1057
M3 - Article
C2 - 29177466
AN - SCOPUS:85045194833
SN - 0305-1048
VL - 46
SP - 314
EP - 323
JO - Nucleic acids research
JF - Nucleic acids research
IS - 1
ER -