TY - JOUR
T1 - When, Where and How Does it Fail? A Spatial-Temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving
AU - Wang, Junhong
AU - Li, Yun
AU - Zhou, Zhaoyu
AU - Wang, Chengshun
AU - Hou, Yijie
AU - Zhang, Li
AU - Xue, Xiangyang
AU - Kamp, Michael
AU - Zhang, Xiaolong Luke
AU - Chen, Siming
N1 - Publisher Copyright:
© 1995-2012 IEEE.
PY - 2023/12/1
Y1 - 2023/12/1
N2 - Arguably the most representative application of artificial intelligence, autonomous driving systems usually rely on computer vision techniques to detect the situations of the external environment. Object detection underpins the ability of scene understanding in such systems. However, existing object detection algorithms often behave as a black box, so when a model fails, no information is available on When, Where and How the failure happened. In this paper, we propose a visual analytics approach to help model developers interpret the model failures. The system includes the micro- and macro-interpreting modules to address the interpretability problem of object detection in autonomous driving. The micro-interpreting module extracts and visualizes the features of a convolutional neural network (CNN) algorithm with density maps, while the macro-interpreting module provides spatial-temporal information of an autonomous driving vehicle and its environment. With the situation awareness of the spatial, temporal and neural network information, our system facilitates the understanding of the results of object detection algorithms, and helps the model developers better understand, tune and develop the models. We use real-world autonomous driving data to perform case studies by involving domain experts in computer vision and autonomous driving to evaluate our system. The results from our interviews with them show the effectiveness of our approach.
AB - Arguably the most representative application of artificial intelligence, autonomous driving systems usually rely on computer vision techniques to detect the situations of the external environment. Object detection underpins the ability of scene understanding in such systems. However, existing object detection algorithms often behave as a black box, so when a model fails, no information is available on When, Where and How the failure happened. In this paper, we propose a visual analytics approach to help model developers interpret the model failures. The system includes the micro- and macro-interpreting modules to address the interpretability problem of object detection in autonomous driving. The micro-interpreting module extracts and visualizes the features of a convolutional neural network (CNN) algorithm with density maps, while the macro-interpreting module provides spatial-temporal information of an autonomous driving vehicle and its environment. With the situation awareness of the spatial, temporal and neural network information, our system facilitates the understanding of the results of object detection algorithms, and helps the model developers better understand, tune and develop the models. We use real-world autonomous driving data to perform case studies by involving domain experts in computer vision and autonomous driving to evaluate our system. The results from our interviews with them show the effectiveness of our approach.
UR - http://www.scopus.com/inward/record.url?scp=85137873200&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85137873200&partnerID=8YFLogxK
U2 - 10.1109/TVCG.2022.3201101
DO - 10.1109/TVCG.2022.3201101
M3 - Article
C2 - 36040948
AN - SCOPUS:85137873200
SN - 1077-2626
VL - 29
SP - 5033
EP - 5049
JO - IEEE Transactions on Visualization and Computer Graphics
JF - IEEE Transactions on Visualization and Computer Graphics
IS - 12
ER -