跳到主要导航 跳到搜索 跳到主要内容

RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient

  • Xi'an Jiaotong University
  • Huawei Technologies Co., Ltd.

科研成果: 期刊稿件文章同行评审

摘要

Keypoint detection and descriptor matching are two vital steps in the 3D feature extraction framework, but they are difficult to learn in an end-to-end fashion due to their inherent discreteness. To tackle the non-differentiable operations, we formulate feature extraction as a decision-making problem: the network is treated as a policy pool that can make probabilistic estimations for keypoint selection and feature matching, supervised by maximizing a reward expectation of actions. In this way, we propose a novel end-to-end training paradigm of 3D feature extraction based on the stochastic policy gradient method, named Reinforced Detectors and Descriptors (RDD). Firstly, we propose a local-to-global probabilistic keypoint selection module that formulates the sampling probabilities of keypoints in a local-and-global mechanism to yield sparse and accurate keypoints. Secondly, we regard feature matching as an optimal transport problem and an efficient Sinkhorn method is leveraged to solve the optimal matching probabilities. In particular, we carefully design a reward function and derive gradients of probabilistic actions, thus overcoming the discreteness and providing reinforced supervision signals. Since our reward function is calculated from sampled keypoints rather than from randomly sampled points as in existing methods, the gap between training and inference is bridged. Experimental results demonstrate that our approach exceeds the quality of state-of-the-art methods and shows strong generalization ability. Remarkably, our approach can achieve significantly higher Registration Recall than other advanced methods when aligning scenes with a small number of keypoints, due to our highly accurate and repeatable detector.

源语言英语
页(从-至)900-913
页数14
期刊IEEE Transactions on Multimedia
27
DOI
出版状态已出版 - 2025

学术指纹

探究 'RDD: Learning Reinforced 3D Detectors and Descriptors Based on Policy Gradient' 的科研主题。它们共同构成独一无二的指纹。

引用此