Abstract
Cyber-physical systems (CPS) with reinforcement learning (RL)-based controllers are increasingly being deployed in complex physical environments such as autonomous vehicles, the Internet-of-Things (IoT), and smart cities. An important property of a CPS is tolerance; i.e., its ability to function safely under possible disturbances and uncertainties in the actual operation. In this paper, we introduce a new, expressive notion of tolerance that describes how well a controller is capable of satisfying a desired system requirement, specified using Signal Temporal Logic (STL), under possible deviations in the system. Based on this definition, we propose a novel analysis problem, called the tolerance falsification problem, which involves finding small deviations that result in a violation of the given requirement. We present a novel, two-layer simulation-based analysis framework and a novel search heuristic for finding small tolerance violations. To evaluate our approach, we construct a set of benchmark problems where system parameters can be configured to represent different types of uncertainties and disturbances in the system. Our evaluation shows that our falsification approach and heuristic can effectively find small tolerance violations.
| Original language | English (US) |
|---|---|
| Title of host publication | Formal Methods - 26th International Symposium, FM 2024, Proceedings |
| Editors | Andre Platzer, Kristin Yvonne Rozier, Matteo Pradella, Matteo Rossi |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 267-285 |
| Number of pages | 19 |
| ISBN (Print) | 9783031711763 |
| DOIs | |
| State | Published - 2025 |
| Event | 26th International Symposium on Formal Methods, FM 2024 - Milan, Italy Duration: Sep 9 2024 → Sep 13 2024 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 14934 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 26th International Symposium on Formal Methods, FM 2024 |
|---|---|
| Country/Territory | Italy |
| City | Milan |
| Period | 9/9/24 → 9/13/24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 11 Sustainable Cities and Communities
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- General Computer Science
Fingerprint
Dive into the research topics of 'Tolerance of Reinforcement Learning Controllers Against Deviations in Cyber Physical Systems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver