Physical Knowledge Driven Multi-scale Temporal Receptive Field Network for Compressed Video Action Recognition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Intelligent terminal based action recognition is important to smart cities. However, due to the dependency on training data and high complexity of extracting information, the existing image based methods cannot be implemented. Moreover, recognizing the actions with different durations is still a challenge. Due to the issues, we first extend traditional image domain to the compressed domain to efficiently extract the information of key frames and physical knowledge MVs (Motion Vectors), which can reflect the multi-scale temporal feature, without complete decoding. Then, to recognize the actions with different durations, a multi-scale temporal receptive field network including short-Term and long-Term branches, is proposed to capture the action's instant change based on the extracted MVs, the long temporal feature between adjacent key frames and the interaction between them simultaneously. Results show that our algorithm can achieve better balance between accuracy and computation complexity.

Original languageEnglish
Title of host publicationUbiComp/ISWC 2021 - Adjunct Proceedings of the 2021 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2021 ACM International Symposium on Wearable Computers
PublisherAssociation for Computing Machinery, Inc
Pages625-630
Number of pages6
ISBN (Electronic)9781450384612
DOIs
StatePublished - 24 Sep 2021
Event2021 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2021 ACM International Symposium on Wearable Computers, UbiComp/ISWC 2021 - Virtual, Online, United States
Duration: 21 Sep 202125 Sep 2021

Publication series

NameUbiComp/ISWC 2021 - Adjunct Proceedings of the 2021 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2021 ACM International Symposium on Wearable Computers

Conference

Conference2021 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2021 ACM International Symposium on Wearable Computers, UbiComp/ISWC 2021
Country/TerritoryUnited States
CityVirtual, Online
Period21/09/2125/09/21

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Keywords

  • Action recognition
  • Physical knowledge driven
  • Video compressed domain,Multi-scale temporal receptive field

Fingerprint

Dive into the research topics of 'Physical Knowledge Driven Multi-scale Temporal Receptive Field Network for Compressed Video Action Recognition'. Together they form a unique fingerprint.

Cite this