Spatial Temporal Graph Deconvolutional Network for Skeleton-Based Human Action Recognition

Research output: Contribution to journalArticlepeer-review

67 Scopus citations

Abstract

Benefited from the powerful ability of spatial temporal Graph Convolutional Networks (ST-GCNs), skeleton-based human action recognition has gained promising success. However, the node interaction through message propagation does not always provide complementary information. Instead, it May even produce destructive noise and thus make learned representations indistinguishable. Inevitably, the graph representation would also become over-smoothing especially when multiple GCN layers are stacked. This paper proposes spatial-temporal graph deconvolutional networks (ST-GDNs), a novel and flexible graph deconvolution technique, to alleviate this issue. At its core, this method provides a better message aggregation by removing the embedding redundancy of the input graphs from either node-wise, frame-wise or element-wise at different network layers. Extensive experiments on three current most challenging benchmarks verify that ST-GDN consistently improves the performance and largely reduce the model size on these datasets.

Original languageEnglish
Article number9314910
Pages (from-to)244-248
Number of pages5
JournalIEEE Signal Processing Letters
Volume28
DOIs
StatePublished - 2021

Keywords

  • Graph neural network
  • over-smoothing
  • skeleton-based action recognition

Fingerprint

Dive into the research topics of 'Spatial Temporal Graph Deconvolutional Network for Skeleton-Based Human Action Recognition'. Together they form a unique fingerprint.

Cite this