跳到主要导航 跳到搜索 跳到主要内容

Towards Accurate Semi-Supervised BEV 3D Object Detection with Depth-Aware Refinement and Denoising-Aided Alignment

  • Zhao Yang
  • , Yinan Shi
  • , Jiangtong Zhu
  • , Weixiang Xu
  • , Longjun Liu
  • Xi'an Jiaotong University
  • Technical University of Munich
  • CAS - Institute of Automation
  • University of Chinese Academy of Sciences

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recently, camera-based Bird's-Eye View (BEV) representation has gained significant traction in 3D object detection. However, training high-performance BEV 3D detectors typically requires a large number of annotated samples, which can be costly. Traditional semi-supervised methods for BEV 3D object detection face challenges including loss of rich depth information, inconsistent object representations across spaces, and unreliable pseudo label generation, leading to decreased accuracy and performance. Addressing this challenge, we pioneer the introduction of a semi-supervised BEV 3D object detection framework. Our approach leverages a small set of labeled data alongside a larger set of unlabeled data, significantly reducing annotation costs while maintaining robust detection performance. Firstly, we propose a depth-based self-refinement module to generate high-quality and stable pseudo labels, which can effectively regulate training with noisy labels. Secondly, we designed a denoising labels regression module that integrates denoising for both labeled and unlabeled data. Thirdly, in order to alleviate object inconsistency, we propose a consistent object-guided alignment method to ensure the consistency of objects in multi-spaces. Finally, our method can be easily plugged into various BEV 3D detection networks. Extensive experiments show that the proposed method achieves a new state-of-the-art compared to various camera-based 3D detectors tested on multiple public autonomous driving datasets.

源语言英语
主期刊名2025 IEEE International Conference on Robotics and Automation, ICRA 2025
编辑Christian Ott, Henny Admoni, Sven Behnke, Stjepan Bogdan, Aude Bolopion, Youngjin Choi, Fanny Ficuciello, Nicholas Gans, Clement Gosselin, Kensuke Harada, Erdal Kayacan, H. Jin Kim, Stefan Leutenegger, Zhe Liu, Perla Maiolino, Lino Marques, Takamitsu Matsubara, Anastasia Mavromatti, Mark Minor, Jason O'Kane, Hae Won Park, Hae-Won Park, Ioannis Rekleitis, Federico Renda, Elisa Ricci, Laurel D. Riek, Lorenzo Sabattini, Shaojie Shen, Yu Sun, Pierre-Brice Wieber, Katsu Yamane, Jingjin Yu
出版商Institute of Electrical and Electronics Engineers Inc.
15740-15746
页数7
ISBN(电子版)9798331541392
DOI
出版状态已出版 - 2025
活动2025 IEEE International Conference on Robotics and Automation, ICRA 2025 - Atlanta, 美国
期限: 19 5月 202523 5月 2025

出版系列

姓名Proceedings - IEEE International Conference on Robotics and Automation
ISSN(印刷版)1050-4729

会议

会议2025 IEEE International Conference on Robotics and Automation, ICRA 2025
国家/地区美国
Atlanta
时期19/05/2523/05/25

学术指纹

探究 'Towards Accurate Semi-Supervised BEV 3D Object Detection with Depth-Aware Refinement and Denoising-Aided Alignment' 的科研主题。它们共同构成独一无二的指纹。

引用此