跳到主要导航 跳到搜索 跳到主要内容

Multiple anchor learning for visual object detection

  • Wei Ke
  • , Tianliang Zhang
  • , Zeyi Huang
  • , Qixiang Ye
  • , Jianzhuang Liu
  • , Dong Huang
  • Carnegie Mellon University
  • University of Chinese Academy of Sciences
  • Shenzhen Institute of Advanced Technology

科研成果: 期刊稿件会议文章同行评审

103 引用 (Scopus)

摘要

Classification and localization are two pillars of visual object detectors. However, in CNN-based detectors, these two modules are usually optimized under a fixed set of candidate (or anchor) bounding boxes. This configuration significantly limits the possibility to jointly optimize classification and localization. In this paper, we propose a Multiple Instance Learning (MIL) approach that selects anchors and jointly optimizes the two modules of a CNN-based object detector. Our approach, referred to as Multiple Anchor Learning (MAL), constructs anchor bags and selects the most representative anchors from each bag. Such an iterative selection process is potentially NP-hard to optimize. To address this issue, we solve MAL by repetitively depressing the confidence of selected anchors by perturbing their corresponding features. In an adversarial selection-depression manner, MAL not only pursues optimal solutions but also fully leverages multiple anchors/features to learn a detection model. Experiments show that MAL improves the baseline RetinaNet with significant margins on the commonly used MS-COCO object detection benchmark and achieves new state-of-the-art detection performance compared with recent methods.

源语言英语
文章编号9156521
页(从-至)10203-10212
页数10
期刊Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
DOI
出版状态已出版 - 2020
活动2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020 - Virtual, Online, 美国
期限: 14 6月 202019 6月 2020

学术指纹

探究 'Multiple anchor learning for visual object detection' 的科研主题。它们共同构成独一无二的指纹。

引用此