MBDF-Net: Multi-Branch Deep Fusion Network for 3D Object Detection

  • Xun Tan
  • , Xingyu Chen
  • , Guowei Zhang
  • , Jishiyu Ding
  • , Xuguang Lan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Point clouds and images could provide complementary information when representing 3D objects. Fusing the two kinds of data usually helps to improve the detection results. However, it is challenging to fuse the two data modalities, due to their different characteristics and the interference from the non-interest areas. To solve this problem, we propose a Multi-Branch Deep Fusion Network (MBDF-Net) for 3D object detection. The proposed detector has two stages. In the first stage, our multi-branch feature extraction network utilizes Adaptive Attention Fusion (AAF) modules to produce cross-modal fusion features from single-modal semantic features. In the second stage, we use a region of interest (RoI) -pooled fusion module to generate enhanced local features for refinement. A novel attention-based hybrid sampling strategy is also proposed for selecting key points in the downsampling process. We evaluate our approach on two widely used benchmark datasets including KITTI and SUN-RGBD. The experimental results demonstrate the advantages of our method over state-of-the-art approaches.

Original languageEnglish
Title of host publicationUrbanMM 2021 - Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, co-located with ACM MM 2021
PublisherAssociation for Computing Machinery, Inc
Pages9-17
Number of pages9
ISBN (Electronic)9781450386692
DOIs
StatePublished - 22 Oct 2021
Event1st International Workshop on Multimedia Computing for Urban Data, UrbanMM 2021, co-located with ACM Multimedia 2021 - Virtual, Online, China
Duration: 20 Oct 2021 → …

Publication series

NameUrbanMM 2021 - Proceedings of the 1st International Workshop on Multimedia Computing for Urban Data, co-located with ACM MM 2021

Conference

Conference1st International Workshop on Multimedia Computing for Urban Data, UrbanMM 2021, co-located with ACM Multimedia 2021
Country/TerritoryChina
CityVirtual, Online
Period20/10/21 → …

Keywords

  • 3d object detection
  • multi-modal fusion
  • point cloud downsampling

Fingerprint

Dive into the research topics of 'MBDF-Net: Multi-Branch Deep Fusion Network for 3D Object Detection'. Together they form a unique fingerprint.

Cite this