跳到主要导航 跳到搜索 跳到主要内容

Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition

  • Xi'an Jiaotong University

科研成果: 期刊稿件文章同行评审

39 引用 (Scopus)

摘要

3D human action recognition has received increasing attention due to its potential application in video surveillance equipment. To guarantee satisfactory performance, previous studies are mainly based on supervised methods, which have to add a large amount of manual annotation costs. In addition, general deep networks for video sequences suffer from heavy computational costs, thus cannot satisfy the basic requirement of embedded systems. In this paper, a novel Motion Guided Attention Learning (MG-AL) framework is proposed, which formulates the action representation learning as a self-supervised motion attention prediction problem. Specifically, MG-AL is a lightweight network. A set of simple motion priors (e.g., intra-joint variance, inter-frame deviation, intra-joint variance, and cross-joint covariance), which minimizes additional parameters and computational overhead, is regarded as a supervisory signal to guide the attention generation. The encoder is trained via predicting multiple self-attention tasks to capture action-specific feature representations. Extensive evaluations are performed on three challenging benchmark datasets (NTU-RGB+D 60, NTU-RGB+D 120 and NW-UCLA). The proposed method achieves superior performance compared to state-of-the-art methods, while having a very low computational cost.

源语言英语
页(从-至)8623-8634
页数12
期刊IEEE Transactions on Circuits and Systems for Video Technology
32
12
DOI
出版状态已出版 - 1 12月 2022

学术指纹

探究 'Motion Guided Attention Learning for Self-Supervised 3D Human Action Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此