TY - JOUR
T1 - A Reinforcement Learning Method for Multiasset Roadway Improvement Scheduling Considering Traffic Impacts
AU - Zhou, Weiwen
AU - Miller-Hooks, Elise
AU - Papakonstantinou, Kostas G.
AU - Stoffels, Shelley
AU - McNeil, Sue
N1 - Publisher Copyright:
© 2022 American Society of Civil Engineers.
PY - 2022/12/1
Y1 - 2022/12/1
N2 - Maintaining roadway pavements and bridge decks is key to providing high levels of service for road users. However, improvement actions incur downtime. These actions are typically scheduled by asset class, yet implemented on any asset type, they have network-wide impacts on traffic performance. This paper presents a bilevel program wherein the upper level involves a Markov decision process (MDP) through which potential roadway improvement actions across asset classes are prioritized and scheduled. The MDP approach considers uncertainty in component deterioration effects, while incorporating the benefits of implemented improvement actions. The upper level takes as input traffic flow estimates obtained from a lower-level user equilibrium traffic formulation that recognizes changes in capacities determined by decisions taken in the upper level. Because an exact solution of this bilevel, stochastic, dynamic program is formidable, a deep reinforcement learning (DRL) method is developed. The model and solution methodology were tested on a hypothetical problem from the literature. The importance of obtaining optimal activity plans that account for downtime effects, traffic congestion impacts, uncertainty in deterioration processes, and multiasset classes is demonstrated.
AB - Maintaining roadway pavements and bridge decks is key to providing high levels of service for road users. However, improvement actions incur downtime. These actions are typically scheduled by asset class, yet implemented on any asset type, they have network-wide impacts on traffic performance. This paper presents a bilevel program wherein the upper level involves a Markov decision process (MDP) through which potential roadway improvement actions across asset classes are prioritized and scheduled. The MDP approach considers uncertainty in component deterioration effects, while incorporating the benefits of implemented improvement actions. The upper level takes as input traffic flow estimates obtained from a lower-level user equilibrium traffic formulation that recognizes changes in capacities determined by decisions taken in the upper level. Because an exact solution of this bilevel, stochastic, dynamic program is formidable, a deep reinforcement learning (DRL) method is developed. The model and solution methodology were tested on a hypothetical problem from the literature. The importance of obtaining optimal activity plans that account for downtime effects, traffic congestion impacts, uncertainty in deterioration processes, and multiasset classes is demonstrated.
UR - http://www.scopus.com/inward/record.url?scp=85137686221&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85137686221&partnerID=8YFLogxK
U2 - 10.1061/(ASCE)IS.1943-555X.0000702
DO - 10.1061/(ASCE)IS.1943-555X.0000702
M3 - Article
AN - SCOPUS:85137686221
SN - 1076-0342
VL - 28
JO - Journal of Infrastructure Systems
JF - Journal of Infrastructure Systems
IS - 4
M1 - 04022033
ER -