TY - GEN
T1 - Tolerance of Reinforcement Learning Controllers Against Deviations in Cyber Physical Systems
AU - Zhang, Changjian
AU - Kapoor, Parv
AU - Meira-Góes, Rômulo
AU - Garlan, David
AU - Kang, Eunsuk
AU - Ganlath, Akila
AU - Mishra, Shatadal
AU - Ammar, Nejib
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025
Y1 - 2025
N2 - Cyber-physical systems (CPS) with reinforcement learning (RL)-based controllers are increasingly being deployed in complex physical environments such as autonomous vehicles, the Internet-of-Things (IoT), and smart cities. An important property of a CPS is tolerance; i.e., its ability to function safely under possible disturbances and uncertainties in the actual operation. In this paper, we introduce a new, expressive notion of tolerance that describes how well a controller is capable of satisfying a desired system requirement, specified using Signal Temporal Logic (STL), under possible deviations in the system. Based on this definition, we propose a novel analysis problem, called the tolerance falsification problem, which involves finding small deviations that result in a violation of the given requirement. We present a novel, two-layer simulation-based analysis framework and a novel search heuristic for finding small tolerance violations. To evaluate our approach, we construct a set of benchmark problems where system parameters can be configured to represent different types of uncertainties and disturbances in the system. Our evaluation shows that our falsification approach and heuristic can effectively find small tolerance violations.
AB - Cyber-physical systems (CPS) with reinforcement learning (RL)-based controllers are increasingly being deployed in complex physical environments such as autonomous vehicles, the Internet-of-Things (IoT), and smart cities. An important property of a CPS is tolerance; i.e., its ability to function safely under possible disturbances and uncertainties in the actual operation. In this paper, we introduce a new, expressive notion of tolerance that describes how well a controller is capable of satisfying a desired system requirement, specified using Signal Temporal Logic (STL), under possible deviations in the system. Based on this definition, we propose a novel analysis problem, called the tolerance falsification problem, which involves finding small deviations that result in a violation of the given requirement. We present a novel, two-layer simulation-based analysis framework and a novel search heuristic for finding small tolerance violations. To evaluate our approach, we construct a set of benchmark problems where system parameters can be configured to represent different types of uncertainties and disturbances in the system. Our evaluation shows that our falsification approach and heuristic can effectively find small tolerance violations.
UR - http://www.scopus.com/inward/record.url?scp=85205088722&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85205088722&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-71177-0_17
DO - 10.1007/978-3-031-71177-0_17
M3 - Conference contribution
AN - SCOPUS:85205088722
SN - 9783031711763
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 267
EP - 285
BT - Formal Methods - 26th International Symposium, FM 2024, Proceedings
A2 - Platzer, Andre
A2 - Rozier, Kristin Yvonne
A2 - Pradella, Matteo
A2 - Rossi, Matteo
PB - Springer Science and Business Media Deutschland GmbH
T2 - 26th International Symposium on Formal Methods, FM 2024
Y2 - 9 September 2024 through 13 September 2024
ER -