TY - GEN
T1 - 3D-MBNet
T2 - 25th IEEE International Conference on Intelligent Transportation Systems, ITSC 2022
AU - Wang, Shengqi
AU - Huang, Yuhao
AU - Kang, Miao
AU - Chen, Badong
AU - Zheng, Nanning
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Predicting vehicle trajectories in traffic scenes is an important issue for autonomous driving and advanced driver assistance systems. The spatial and temporal interactions between the predicted vehicle and its surrounding vehicles are commonly used for trajectory prediction. However, most existing methods usually deal with the two interactions separately. In this paper, we propose a multimodal vehicle trajectory prediction model, which includes a 3D social convolution module to jointly model spatial and temporal interactions. Furthermore, to make interpretable multimodal predictions, we define several types of driving intentions (namely lane changes, acceleration, deceleration and driving with uniform velocity) and introduce multi-branch decoders to decode different intentions. The two-stage training strategy is introduced to guarantee the recall rates of the intentions with long-tailed distributions. Our model outperforms several state-of-the-art methods on the NGSIM and highD benchmark datasets. In addition, the ablation experiments show that our method can improve the prediction accuracy of each specified modality.
AB - Predicting vehicle trajectories in traffic scenes is an important issue for autonomous driving and advanced driver assistance systems. The spatial and temporal interactions between the predicted vehicle and its surrounding vehicles are commonly used for trajectory prediction. However, most existing methods usually deal with the two interactions separately. In this paper, we propose a multimodal vehicle trajectory prediction model, which includes a 3D social convolution module to jointly model spatial and temporal interactions. Furthermore, to make interpretable multimodal predictions, we define several types of driving intentions (namely lane changes, acceleration, deceleration and driving with uniform velocity) and introduce multi-branch decoders to decode different intentions. The two-stage training strategy is introduced to guarantee the recall rates of the intentions with long-tailed distributions. Our model outperforms several state-of-the-art methods on the NGSIM and highD benchmark datasets. In addition, the ablation experiments show that our method can improve the prediction accuracy of each specified modality.
UR - https://www.scopus.com/pages/publications/85141832486
U2 - 10.1109/ITSC55140.2022.9921812
DO - 10.1109/ITSC55140.2022.9921812
M3 - 会议稿件
AN - SCOPUS:85141832486
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 880
EP - 887
BT - 2022 IEEE 25th International Conference on Intelligent Transportation Systems, ITSC 2022
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 8 October 2022 through 12 October 2022
ER -