跳到主要导航 跳到搜索 跳到主要内容

Point-RMAE: Reinforcement Masked Autoencoder for 3D Representation Learning

  • Haozhe Cheng
  • , Lintong Wei
  • , Wenjing Wang
  • , Wenbiao Yan
  • , Jinqian Chen
  • , Jian Lu
  • , Kun Yue
  • , Jihua Zhu
  • Xi'an Jiaotong University
  • Xi'an Polytechnic University
  • Yunnan University

科研成果: 期刊稿件文章同行评审

摘要

The Mainstream 3D masked point modeling representation learning community typically employs predefined, fixed-ratio random or block masking strategies, aiming to obtain optimal representations and achieve high downstream performance. However, these empirical designs overlook the significant geometric information and structural importance differences that are inherent among different 3D points, leading to a suboptimal trade-off between the representation capture capabilities and reconstruction difficulty of such masking strategies. To address this issue, we are the first to present this decision-making problem to a reinforcement learning agent and propose a Reinforcement Masked Autoencoder for 3D representation learning, named Point-RMAE. Guided by geometric features as state factor, this method leverages the Masking Strategy Analyzer and the Dynamic Masking Generator to adaptively decide and apply the masking strategy during pretraining. The Masking Ratio Scheduling module dynamically adjusts the masking ratio based on the optimal strategy. Subsequently, the analyzer is updated by multiscale rewards derived from reconstruction quality level, distribution-aware feedback, and policy exploration. Notably, to enrich the Reward Function with distribution-aware signals and avoid decision collapse issue, we propose a Flow Matching Point Cloud Fast Generator that guides the selected masking decisions. Our method achieves outstanding performance across downstream tasks such as shape classification, medical diagnosis, object detection, action recognition, denoising and multiscale scene segmentation on ten popular 3D and 4D datasets. More importantly, Point-RMAE pioneers the application of reinforcement learning in 3D self-supervised representation learning.

源语言英语
期刊IEEE Transactions on Image Processing
DOI
出版状态已接受/待刊 - 2026
已对外发布

学术指纹

探究 'Point-RMAE: Reinforcement Masked Autoencoder for 3D Representation Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此