TY - GEN
T1 - Efficient Backbone Architecture Search for Stereo Depth Estimation in Autonomous Driving
AU - Zhang, Xuchong
AU - Dai, He
AU - Chen, Jianing
AU - Sun, Hongbin
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Recent advances in AutoML have extended Neural Architecture Search (NAS) beyond image classification to optimize dense prediction tasks. However, the existing works are inappropriate to search efficient backbone for deep learning based stereo matching, because their search spaces are not custom-designed according to the inherent requirements of the pixel-wise depth prediction. This paper proposes a differentiable architecture search specific for efficient stereo network backbone. In particular, the proposed method jointly optimizes the micro-architecture and the macro-architecture to search distinct cell structures and adaptive low-level features for stereo network backbone. The target architecture can be found within 3 GPU days using gradient-based optimization. The evaluation results on stereo datasets demonstrate that, by simply replacing the hand-crafted feature extraction with the searched backbone in a vanilla framework, the proposed network obtains much better disparity accuracy than the designs using existing NAS methods, and even achieves comparable performance compared with the state-of-the-art stereo networks that integrate various elaborate modules. Hence, the proposed NAS method is an efficient way to automate the stereo network architecture engineering.
AB - Recent advances in AutoML have extended Neural Architecture Search (NAS) beyond image classification to optimize dense prediction tasks. However, the existing works are inappropriate to search efficient backbone for deep learning based stereo matching, because their search spaces are not custom-designed according to the inherent requirements of the pixel-wise depth prediction. This paper proposes a differentiable architecture search specific for efficient stereo network backbone. In particular, the proposed method jointly optimizes the micro-architecture and the macro-architecture to search distinct cell structures and adaptive low-level features for stereo network backbone. The target architecture can be found within 3 GPU days using gradient-based optimization. The evaluation results on stereo datasets demonstrate that, by simply replacing the hand-crafted feature extraction with the searched backbone in a vanilla framework, the proposed network obtains much better disparity accuracy than the designs using existing NAS methods, and even achieves comparable performance compared with the state-of-the-art stereo networks that integrate various elaborate modules. Hence, the proposed NAS method is an efficient way to automate the stereo network architecture engineering.
UR - https://www.scopus.com/pages/publications/85141873787
U2 - 10.1109/ITSC55140.2022.9922562
DO - 10.1109/ITSC55140.2022.9922562
M3 - 会议稿件
AN - SCOPUS:85141873787
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 355
EP - 362
BT - 2022 IEEE 25th International Conference on Intelligent Transportation Systems, ITSC 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE International Conference on Intelligent Transportation Systems, ITSC 2022
Y2 - 8 October 2022 through 12 October 2022
ER -