TY - JOUR
T1 - Multi-fidelity reinforcement learning framework for shape optimization
AU - Bhola, Sahil
AU - Pawar, Suraj
AU - Balaprakash, Prasanna
AU - Maulik, Romit
N1 - Funding Information:
This material is based upon work supported by the U.S. Department of Energy (DOE), Office of Science , Office of Advanced Scientific Computing Research, under Contract DE-AC02-06CH11357 . This research was funded in part and used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357 .
Publisher Copyright:
© 2023 Elsevier Inc.
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Deep reinforcement learning (DRL) is a promising outer-loop intelligence paradigm which can deploy problem solving strategies for complex tasks. Consequently, DRL has been utilized for several scientific applications, specifically in cases where classical optimization or control methods are limited. One key limitation of conventional DRL methods is their episode-hungry nature which proves to be a bottleneck for tasks which involve costly evaluations of a numerical forward model. In this article, we address this limitation of DRL by introducing a controlled transfer learning framework that leverages a multi-fidelity simulation setting. Our strategy is deployed for an airfoil shape optimization problem at high Reynolds numbers, where our framework can learn an optimal policy for generating efficient airfoil shapes by gathering knowledge from multi-fidelity environments and reduces computational costs by over 30%. Furthermore, our formulation promotes policy exploration and generalization to new environments, thereby preventing over-fitting to data from solely one fidelity. Our results demonstrate this framework's applicability to other scientific DRL scenarios where multi-fidelity environments can be used for policy learning.
AB - Deep reinforcement learning (DRL) is a promising outer-loop intelligence paradigm which can deploy problem solving strategies for complex tasks. Consequently, DRL has been utilized for several scientific applications, specifically in cases where classical optimization or control methods are limited. One key limitation of conventional DRL methods is their episode-hungry nature which proves to be a bottleneck for tasks which involve costly evaluations of a numerical forward model. In this article, we address this limitation of DRL by introducing a controlled transfer learning framework that leverages a multi-fidelity simulation setting. Our strategy is deployed for an airfoil shape optimization problem at high Reynolds numbers, where our framework can learn an optimal policy for generating efficient airfoil shapes by gathering knowledge from multi-fidelity environments and reduces computational costs by over 30%. Furthermore, our formulation promotes policy exploration and generalization to new environments, thereby preventing over-fitting to data from solely one fidelity. Our results demonstrate this framework's applicability to other scientific DRL scenarios where multi-fidelity environments can be used for policy learning.
UR - http://www.scopus.com/inward/record.url?scp=85150067313&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85150067313&partnerID=8YFLogxK
U2 - 10.1016/j.jcp.2023.112018
DO - 10.1016/j.jcp.2023.112018
M3 - Article
AN - SCOPUS:85150067313
SN - 0021-9991
VL - 482
JO - Journal of Computational Physics
JF - Journal of Computational Physics
M1 - 112018
ER -