TY - JOUR
T1 - Surveying the energy landscape of coarse-grained mappings
AU - Kidder, Katherine M.
AU - Shell, M. Scott
AU - Noid, W. G.
N1 - Publisher Copyright:
© 2024 Author(s).
PY - 2024/2/7
Y1 - 2024/2/7
N2 - Simulations of soft materials often adopt low-resolution coarse-grained (CG) models. However, the CG representation is not unique and its impact upon simulated properties is poorly understood. In this work, we investigate the space of CG representations for ubiquitin, which is a typical globular protein with 72 amino acids. We employ Monte Carlo methods to ergodically sample this space and to characterize its landscape. By adopting the Gaussian network model as an analytically tractable atomistic model for equilibrium fluctuations, we exactly assess the intrinsic quality of each CG representation without introducing any approximations in sampling configurations or in modeling interactions. We focus on two metrics, the spectral quality and the information content, that quantify the extent to which the CG representation preserves low-frequency, large-amplitude motions and configurational information, respectively. The spectral quality and information content are weakly correlated among high-resolution representations but become strongly anticorrelated among low-resolution representations. Representations with maximal spectral quality appear consistent with physical intuition, while low-resolution representations with maximal information content do not. Interestingly, quenching studies indicate that the energy landscape of mapping space is very smooth and highly connected. Moreover, our study suggests a critical resolution below which a “phase transition” qualitatively distinguishes good and bad representations.
AB - Simulations of soft materials often adopt low-resolution coarse-grained (CG) models. However, the CG representation is not unique and its impact upon simulated properties is poorly understood. In this work, we investigate the space of CG representations for ubiquitin, which is a typical globular protein with 72 amino acids. We employ Monte Carlo methods to ergodically sample this space and to characterize its landscape. By adopting the Gaussian network model as an analytically tractable atomistic model for equilibrium fluctuations, we exactly assess the intrinsic quality of each CG representation without introducing any approximations in sampling configurations or in modeling interactions. We focus on two metrics, the spectral quality and the information content, that quantify the extent to which the CG representation preserves low-frequency, large-amplitude motions and configurational information, respectively. The spectral quality and information content are weakly correlated among high-resolution representations but become strongly anticorrelated among low-resolution representations. Representations with maximal spectral quality appear consistent with physical intuition, while low-resolution representations with maximal information content do not. Interestingly, quenching studies indicate that the energy landscape of mapping space is very smooth and highly connected. Moreover, our study suggests a critical resolution below which a “phase transition” qualitatively distinguishes good and bad representations.
UR - http://www.scopus.com/inward/record.url?scp=85183978009&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85183978009&partnerID=8YFLogxK
U2 - 10.1063/5.0182524
DO - 10.1063/5.0182524
M3 - Article
C2 - 38310476
AN - SCOPUS:85183978009
SN - 0021-9606
VL - 160
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 5
M1 - 054105
ER -