TY - GEN
T1 - Data-driven anytime algorithms for motion planning with safety guarantees
AU - Jha, Devesh K.
AU - Zhu, Minghui
AU - Wang, Yebin
AU - Ray, Asok
N1 - Publisher Copyright:
© 2016 American Automatic Control Council (AACC).
PY - 2016/7/28
Y1 - 2016/7/28
N2 - This paper presents a learning-based (i.e., data-driven) approach to motion planning of robotic systems. This is motivated by controller synthesis problems for safety critical systems where an accurate estimate of the uncertainties (e.g., unmodeled dynamics, disturbance) can improve the performance of the system. The state-space of the system is built by sampling from the state-set as well as the input set of the underlying system. The robust adaptive motion planning problem is modeled as a learning-based approach evasion differential game, where a machine-learning algorithm is used to update the statistical estimates of the uncertainties from system observations. The system begins with a conservative estimate of the uncertainty set to ensure safety of the underlying system and we relax the robustness constraints as we get better estimates of the unmodeled uncertainty. The estimates from the machine learning algorithm are used to refine the estimates of the controller in an anytime fashion. We show that the values for the game converges to the optimal values with known disturbance given the statistical estimates on the uncertainty converges. Using confidence intervals for the unmodeled disturbance estimated by the machine learning estimator during the transient learning phase, we are able to guarantee safety of the robotic system with the proposed algorithms during transience.
AB - This paper presents a learning-based (i.e., data-driven) approach to motion planning of robotic systems. This is motivated by controller synthesis problems for safety critical systems where an accurate estimate of the uncertainties (e.g., unmodeled dynamics, disturbance) can improve the performance of the system. The state-space of the system is built by sampling from the state-set as well as the input set of the underlying system. The robust adaptive motion planning problem is modeled as a learning-based approach evasion differential game, where a machine-learning algorithm is used to update the statistical estimates of the uncertainties from system observations. The system begins with a conservative estimate of the uncertainty set to ensure safety of the underlying system and we relax the robustness constraints as we get better estimates of the unmodeled uncertainty. The estimates from the machine learning algorithm are used to refine the estimates of the controller in an anytime fashion. We show that the values for the game converges to the optimal values with known disturbance given the statistical estimates on the uncertainty converges. Using confidence intervals for the unmodeled disturbance estimated by the machine learning estimator during the transient learning phase, we are able to guarantee safety of the robotic system with the proposed algorithms during transience.
UR - http://www.scopus.com/inward/record.url?scp=84992053062&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84992053062&partnerID=8YFLogxK
U2 - 10.1109/ACC.2016.7526565
DO - 10.1109/ACC.2016.7526565
M3 - Conference contribution
AN - SCOPUS:84992053062
T3 - Proceedings of the American Control Conference
SP - 5716
EP - 5721
BT - 2016 American Control Conference, ACC 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 American Control Conference, ACC 2016
Y2 - 6 July 2016 through 8 July 2016
ER -