TY - GEN
T1 - Deep reinforcement learning for transfer of control policies
AU - Cunningham, James D.
AU - Miller, Simon W.
AU - Yukish, Michael A.
AU - Simpson, Timothy W.
AU - Tucker, Conrad S.
N1 - Publisher Copyright:
Copyright © 2019 ASME.
PY - 2019
Y1 - 2019
N2 - We present a form-aware reinforcement learning (RL) method to extend control knowledge from one design form to another, without losing the ability to control the original design. A major challenge in developing control knowledge is the creation of generalized control policies across designs of varying form. Our presented RL policy is form-aware because in addition to receiving dynamic state information about the environment, it also receives states that encode information about the form of the design that is being controlled. In this paper, we investigate the impact of this mixed state space on transfer learning. We present a transfer learning method for extending a control policy to a different design form, while continuing to expose the agent to the original design during the training of the new design. To demonstrate this concept, we present a case study of a multi-rotor aircraft simulation, wherein the designated task is to achieve a stable hover. We show that by introducing form states, an RL agent is able to learn a control policy to achieve the hovering task with both a four rotor and three rotor design at once, whereas without the form states it can only hover with the four rotor design. We also benchmark our method against a test case that removes the transfer learning component, as well as a test case that removes the continued exposure to the original design to show the value of each of these components. We find that form states, transfer learning, and parallel learning all contribute to a more robust control policy for the new design, and that parallel learning is especially important for maintaining control knowledge of the original design.
AB - We present a form-aware reinforcement learning (RL) method to extend control knowledge from one design form to another, without losing the ability to control the original design. A major challenge in developing control knowledge is the creation of generalized control policies across designs of varying form. Our presented RL policy is form-aware because in addition to receiving dynamic state information about the environment, it also receives states that encode information about the form of the design that is being controlled. In this paper, we investigate the impact of this mixed state space on transfer learning. We present a transfer learning method for extending a control policy to a different design form, while continuing to expose the agent to the original design during the training of the new design. To demonstrate this concept, we present a case study of a multi-rotor aircraft simulation, wherein the designated task is to achieve a stable hover. We show that by introducing form states, an RL agent is able to learn a control policy to achieve the hovering task with both a four rotor and three rotor design at once, whereas without the form states it can only hover with the four rotor design. We also benchmark our method against a test case that removes the transfer learning component, as well as a test case that removes the continued exposure to the original design to show the value of each of these components. We find that form states, transfer learning, and parallel learning all contribute to a more robust control policy for the new design, and that parallel learning is especially important for maintaining control knowledge of the original design.
UR - http://www.scopus.com/inward/record.url?scp=85076496572&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85076496572&partnerID=8YFLogxK
U2 - 10.1115/DETC2019-97689
DO - 10.1115/DETC2019-97689
M3 - Conference contribution
AN - SCOPUS:85076496572
T3 - Proceedings of the ASME Design Engineering Technical Conference
BT - 45th Design Automation Conference
PB - American Society of Mechanical Engineers (ASME)
T2 - ASME 2019 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC-CIE 2019
Y2 - 18 August 2019 through 21 August 2019
ER -