跳到主要导航 跳到搜索 跳到主要内容

Exploring Action Centers for Temporal Action Localization

  • Xi'an Jiaotong University
  • Wormpex AI Research
  • University of Illinois at Chicago

科研成果: 期刊稿件文章同行评审

27 引用 (Scopus)

摘要

Temporal action localization aims at detecting the temporal intervals of human actions in untrimmed videos. Most previous methods rely on locating and matching the start and end times of actions. However, action boundaries are ambiguous and uncertain in nature, which leads to inaccurate action localization and a lot of false positives. In this paper, we introduce a new framework for temporal action localization. It explicitly models temporal action centers to reduce unreliable action detection results caused by ambiguous action boundaries. Since action centers are highly related to semantic actions, they can be detected more reliably than the conventional action boundaries. As a result, our framework can exclude false positives and promote high-quality proposals. Based on action centers, we propose a triplet feature fusion mechanism. It performs neural message passing among the boundaries and the center as well as contextual regions outside of the proposal to enrich its representation. In addition, we introduce a centerness scoring method to suppress proposals deviating from the centers of action instances. Consequently, our network can retrieve high-quality action proposals and locate actions more precisely. Experimental results show our method outperforms state-of-the-art methods on the THUMOS14 and ActivityNet v1.3 datasets.

源语言英语
页(从-至)9425-9436
页数12
期刊IEEE Transactions on Multimedia
25
DOI
出版状态已出版 - 2023

学术指纹

探究 'Exploring Action Centers for Temporal Action Localization' 的科研主题。它们共同构成独一无二的指纹。

引用此