TY - JOUR
T1 - Specification-Based Autonomous Driving System Testing
AU - Zhou, Yuan
AU - Sun, Yang
AU - Tang, Yun
AU - Chen, Yuqi
AU - Sun, Jun
AU - Poskitt, Christopher M.
AU - Liu, Yang
AU - Yang, Zijiang
N1 - Publisher Copyright:
© 1976-2012 IEEE.
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Autonomous vehicle (AV) systems must be comprehensively tested and evaluated before they can be deployed. High-fidelity simulators such as CARLA or LGSVL allow this to be done safely in very realistic and highly customizable environments. Existing testing approaches, however, fail to test simulated AVs systematically, as they focus on specific scenarios and oracles (e.g., lane following scenario with the 'no collision' requirement) and lack any coverage criteria measures. In this paper, we propose AVUnit a framework for systematically testing AV systems against customizable correctness specifications. Designed modularly to support different simulators, AVUnit consists of two new languages for specifying dynamic properties of scenes (e.g., changing pedestrian behaviour after waypoints) and fine-grained assertions about the AV's journey. AVUnit further supports multiple fuzzing algorithms that automatically search for test cases that violate these assertions, using robustness and coverage measures as fitness metrics. We evaluated the implementation of AVUnit for the LGSVL+Apollo simulation environment, finding 19 kinds of issues in Apollo, which indicate that the open-source Apollo does not perform well in complex intersections and lane-changing related scenarios.
AB - Autonomous vehicle (AV) systems must be comprehensively tested and evaluated before they can be deployed. High-fidelity simulators such as CARLA or LGSVL allow this to be done safely in very realistic and highly customizable environments. Existing testing approaches, however, fail to test simulated AVs systematically, as they focus on specific scenarios and oracles (e.g., lane following scenario with the 'no collision' requirement) and lack any coverage criteria measures. In this paper, we propose AVUnit a framework for systematically testing AV systems against customizable correctness specifications. Designed modularly to support different simulators, AVUnit consists of two new languages for specifying dynamic properties of scenes (e.g., changing pedestrian behaviour after waypoints) and fine-grained assertions about the AV's journey. AVUnit further supports multiple fuzzing algorithms that automatically search for test cases that violate these assertions, using robustness and coverage measures as fitness metrics. We evaluated the implementation of AVUnit for the LGSVL+Apollo simulation environment, finding 19 kinds of issues in Apollo, which indicate that the open-source Apollo does not perform well in complex intersections and lane-changing related scenarios.
KW - Autonomous driving system
KW - coverage criteria
KW - fuzzing
KW - specification languages
KW - testing
UR - https://www.scopus.com/pages/publications/85149856139
U2 - 10.1109/TSE.2023.3254142
DO - 10.1109/TSE.2023.3254142
M3 - 文章
AN - SCOPUS:85149856139
SN - 0098-5589
VL - 49
SP - 3391
EP - 3410
JO - IEEE Transactions on Software Engineering
JF - IEEE Transactions on Software Engineering
IS - 6
ER -