TY - JOUR
T1 - Unsupervised learning of atomic environments from simple features
AU - Reinhart, Wesley F.
N1 - Funding Information:
I thank Sapna Sarupria and Steven Hall for providing the data used to train their PointNET model and for helpful discussions regarding the results. I also thank Antonia Statt for helpful discussions and proofreading of the manuscript. This work was supported in part by the Institute for Computational and Data Sciences and Materials Research Institute at the Pennsylvania State University.
Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2021/8
Y1 - 2021/8
N2 - I present a strategy for unsupervised manifold learning on local atomic environments in molecular simulations based on simple rotation- and permutation-invariant three-body features. These features are highly descriptive, generalize to multiple chemical species, and are human-interpretable. The low-dimensional embeddings of each atomic environment can be used to understand and quantify messy crystal structures such as those near interfaces and defects or well-ordered crystal lattices such as in bulk materials without modification. The same method can also yield collective variables describing collections of particles such as for an entire simulation domain. I demonstrate the method on colloidal crystallization, ice crystals, and binary mesophases to illustrate its broad applicability. In each case, the learned latent space yields insights into the details of the observed microstructures. For ices and mesophases, supervised classifiers are trained based on the learned manifolds and directly compared against a recent neural-network-based approach. Notably, while this method provides comparable classification performance, it can also be deployed on even a handful of observed environments without labels or a priori knowledge. Thus, the current approach provides an incredibly versatile strategy to characterize and classify local atomic environments, and may unlock insights in a wide variety of molecular simulation contexts.
AB - I present a strategy for unsupervised manifold learning on local atomic environments in molecular simulations based on simple rotation- and permutation-invariant three-body features. These features are highly descriptive, generalize to multiple chemical species, and are human-interpretable. The low-dimensional embeddings of each atomic environment can be used to understand and quantify messy crystal structures such as those near interfaces and defects or well-ordered crystal lattices such as in bulk materials without modification. The same method can also yield collective variables describing collections of particles such as for an entire simulation domain. I demonstrate the method on colloidal crystallization, ice crystals, and binary mesophases to illustrate its broad applicability. In each case, the learned latent space yields insights into the details of the observed microstructures. For ices and mesophases, supervised classifiers are trained based on the learned manifolds and directly compared against a recent neural-network-based approach. Notably, while this method provides comparable classification performance, it can also be deployed on even a handful of observed environments without labels or a priori knowledge. Thus, the current approach provides an incredibly versatile strategy to characterize and classify local atomic environments, and may unlock insights in a wide variety of molecular simulation contexts.
UR - http://www.scopus.com/inward/record.url?scp=85105055578&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85105055578&partnerID=8YFLogxK
U2 - 10.1016/j.commatsci.2021.110511
DO - 10.1016/j.commatsci.2021.110511
M3 - Article
AN - SCOPUS:85105055578
SN - 0927-0256
VL - 196
JO - Computational Materials Science
JF - Computational Materials Science
M1 - 110511
ER -