TY - JOUR
T1 - Efficient data acquisition and training of collisional-radiative model artificial neural network surrogates through adaptive parameter space sampling
AU - Garland, Nathan A.
AU - Maulik, Romit
AU - Tang, Qi
AU - Tang, Xian Zhu
AU - Balaprakash, Prasanna
N1 - Publisher Copyright:
© 2022 The Author(s). Published by IOP Publishing Ltd.
PY - 2022/12/1
Y1 - 2022/12/1
N2 - Effective plasma transport modeling of magnetically confined fusion devices relies on having an accurate understanding of the ion composition and radiative power losses of the plasma. Generally, these quantities can be obtained from solutions of a collisional-radiative (CR) model at each time step within a plasma transport simulation. However, even compact, approximate CR models can be computationally onerous to evaluate, and in-situ evaluation of these models within a larger plasma transport code can lead to a rigid bottleneck. As a way to bypass this bottleneck, we propose deploying artificial neural network (ANN) surrogates to allow rapid evaluation of the necessary plasma quantities. However, one issue with training an accurate ANN surrogate is the reliance on a sufficiently large and representative training and validation data set, which can be time-consuming to generate. In this work we explore a data-driven active learning and training routine to allow autonomous adaptive sampling of the problem parameter space to ensure a sufficiently large and meaningful set of training data is assembled for the network training. As a result, we can demonstrate approximately order-of-magnitude savings in required training data samples to produce an accurate surrogate.
AB - Effective plasma transport modeling of magnetically confined fusion devices relies on having an accurate understanding of the ion composition and radiative power losses of the plasma. Generally, these quantities can be obtained from solutions of a collisional-radiative (CR) model at each time step within a plasma transport simulation. However, even compact, approximate CR models can be computationally onerous to evaluate, and in-situ evaluation of these models within a larger plasma transport code can lead to a rigid bottleneck. As a way to bypass this bottleneck, we propose deploying artificial neural network (ANN) surrogates to allow rapid evaluation of the necessary plasma quantities. However, one issue with training an accurate ANN surrogate is the reliance on a sufficiently large and representative training and validation data set, which can be time-consuming to generate. In this work we explore a data-driven active learning and training routine to allow autonomous adaptive sampling of the problem parameter space to ensure a sufficiently large and meaningful set of training data is assembled for the network training. As a result, we can demonstrate approximately order-of-magnitude savings in required training data samples to produce an accurate surrogate.
UR - https://www.scopus.com/pages/publications/85140139241
UR - https://www.scopus.com/pages/publications/85140139241#tab=citedBy
U2 - 10.1088/2632-2153/ac93e7
DO - 10.1088/2632-2153/ac93e7
M3 - Article
AN - SCOPUS:85140139241
SN - 2632-2153
VL - 3
JO - Machine Learning: Science and Technology
JF - Machine Learning: Science and Technology
IS - 4
M1 - 045003
ER -