TY - JOUR
T1 - Predicting 3D RNA structure from the nucleotide sequence using Euclidean neural networks
AU - Sha, Congzhou M.
AU - Wang, Jian
AU - Dokholyan, Nikolay V.
N1 - Publisher Copyright:
© 2023 Biophysical Society
PY - 2024/9/3
Y1 - 2024/9/3
N2 - Fast and accurate 3D RNA structure prediction remains a major challenge in structural biology, mostly due to the size and flexibility of RNA molecules, as well as the lack of diverse experimentally determined structures of RNA molecules. Unlike DNA structure, RNA structure is far less constrained by basepair hydrogen bonding, resulting in an explosion of potential stable states. Here, we propose a convolutional neural network that predicts all pairwise distances between residues in an RNA, using a recently described smooth parametrization of Euclidean distance matrices. We achieve high-accuracy predictions on RNAs up to 100 nt in length in fractions of a second, a factor of 107 faster than existing molecular dynamics-based methods. We also convert our coarse-grained machine learning output into an all-atom model using discrete molecular dynamics with constraints. Our proposed computational pipeline predicts all-atom RNA models solely from the nucleotide sequence. However, this method suffers from the same limitation as nucleic acid molecular dynamics: the scarcity of available RNA crystal structures for training.
AB - Fast and accurate 3D RNA structure prediction remains a major challenge in structural biology, mostly due to the size and flexibility of RNA molecules, as well as the lack of diverse experimentally determined structures of RNA molecules. Unlike DNA structure, RNA structure is far less constrained by basepair hydrogen bonding, resulting in an explosion of potential stable states. Here, we propose a convolutional neural network that predicts all pairwise distances between residues in an RNA, using a recently described smooth parametrization of Euclidean distance matrices. We achieve high-accuracy predictions on RNAs up to 100 nt in length in fractions of a second, a factor of 107 faster than existing molecular dynamics-based methods. We also convert our coarse-grained machine learning output into an all-atom model using discrete molecular dynamics with constraints. Our proposed computational pipeline predicts all-atom RNA models solely from the nucleotide sequence. However, this method suffers from the same limitation as nucleic acid molecular dynamics: the scarcity of available RNA crystal structures for training.
UR - http://www.scopus.com/inward/record.url?scp=85175293015&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85175293015&partnerID=8YFLogxK
U2 - 10.1016/j.bpj.2023.10.011
DO - 10.1016/j.bpj.2023.10.011
M3 - Article
C2 - 37838833
AN - SCOPUS:85175293015
SN - 0006-3495
VL - 123
SP - 2671
EP - 2681
JO - Biophysical journal
JF - Biophysical journal
IS - 17
ER -