TY - GEN
T1 - Deep reinforcement learning-based life-cycle management of deteriorating transportation systems
AU - Saifullah, M.
AU - Andriotis, C. P.
AU - Papakonstantinou, K. G.
AU - Stoffels, S. M.
N1 - Publisher Copyright:
© 2022 the Author(s).
PY - 2023
Y1 - 2023
N2 - Efficient life-cycle bridge asset management delineates a planning optimization problem of paramount importance for the operational reliability of transportation infrastructure. It necessitates adept inspection and maintenance policies able to reduce risks and costs while incorporating long-term stochastic deterioration models, inference under uncertain structural health data, and various probabilistic and deterministic constraints. Structural integrity management policies for individual bridges, which are mere constituents of broader complex networks, cannot be devised in isolation of the policies of other system components, such as other bridges and pavement sections, and without considering system functions and traffic considerations. Such network effects render the optimization problem even harder to solve. Currently, age- or condition-based maintenance techniques, as well as risk-based or periodic inspection plans, have been used to address this class of challenging optimization problems. However, the efficacy of these techniques is often limited by optimality-, scalability-, and uncertainty-induced complexities. In practice, infrastructure management agencies often treat interconnected systems using disjoint plans for different component types, which in general do not ensure system-level optimality. To tackle the above, the optimization problem is herein cast within constrained Partially Observable Markov Decision Processes (POMDPs), which provide a comprehensive mathematical framework for stochastic sequential decision settings under observation/monitoring data uncertainty and limited resources. For the problem solution, the DDMAC algorithm (Deep Decentralized Multi-agent Actor-Critic) is successfully used, a deep reinforcement learning algorithm well-suited for management of large multi-state multi-component systems, as illustrated in an example application of an existing transportation network in Virginia, USA. The studied network comprises several bridge and pavement components exhibiting nonstationary deterioration, and various agency-imposed constraints, and traffic delay and risk factors are considered. Comparisons against conventional management policies showcase that the DDMAC solution significantly outperforms its counterparts.
AB - Efficient life-cycle bridge asset management delineates a planning optimization problem of paramount importance for the operational reliability of transportation infrastructure. It necessitates adept inspection and maintenance policies able to reduce risks and costs while incorporating long-term stochastic deterioration models, inference under uncertain structural health data, and various probabilistic and deterministic constraints. Structural integrity management policies for individual bridges, which are mere constituents of broader complex networks, cannot be devised in isolation of the policies of other system components, such as other bridges and pavement sections, and without considering system functions and traffic considerations. Such network effects render the optimization problem even harder to solve. Currently, age- or condition-based maintenance techniques, as well as risk-based or periodic inspection plans, have been used to address this class of challenging optimization problems. However, the efficacy of these techniques is often limited by optimality-, scalability-, and uncertainty-induced complexities. In practice, infrastructure management agencies often treat interconnected systems using disjoint plans for different component types, which in general do not ensure system-level optimality. To tackle the above, the optimization problem is herein cast within constrained Partially Observable Markov Decision Processes (POMDPs), which provide a comprehensive mathematical framework for stochastic sequential decision settings under observation/monitoring data uncertainty and limited resources. For the problem solution, the DDMAC algorithm (Deep Decentralized Multi-agent Actor-Critic) is successfully used, a deep reinforcement learning algorithm well-suited for management of large multi-state multi-component systems, as illustrated in an example application of an existing transportation network in Virginia, USA. The studied network comprises several bridge and pavement components exhibiting nonstationary deterioration, and various agency-imposed constraints, and traffic delay and risk factors are considered. Comparisons against conventional management policies showcase that the DDMAC solution significantly outperforms its counterparts.
UR - http://www.scopus.com/inward/record.url?scp=85150411322&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85150411322&partnerID=8YFLogxK
U2 - 10.1201/9781003322641-32
DO - 10.1201/9781003322641-32
M3 - Conference contribution
AN - SCOPUS:85150411322
SN - 9781032345314
T3 - Bridge Safety, Maintenance, Management, Life-Cycle, Resilience and Sustainability - Proceedings of the 11th International Conference on Bridge Maintenance, Safety and Management, IABMAS 2022
SP - 293
EP - 301
BT - Bridge Safety, Maintenance, Management, Life-Cycle, Resilience and Sustainability - Proceedings of the 11th International Conference on Bridge Maintenance, Safety and Management, IABMAS 2022
A2 - Casas, Joan-Ramon
A2 - Frangopol, Dan M.
A2 - Turmo, Jose
PB - CRC Press/Balkema
T2 - 11th International Conference on Bridge Maintenance, Safety and Management, IABMAS 2022
Y2 - 11 July 2022 through 15 July 2022
ER -