Video Self-Supervised Cross-Pathway Training Based on Slow and Fast Pathways

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the field of video self-supervised learning, contrastive instance learning methods suffer from a lack of semantic information, resulting in inadequate generalization in downstream tasks. Although optical flow can provide some semantic information, it requires significant computational cost prior to training. To address this, we propose a Video self-supervised Cross-pathway training model based on Slow and Fast pathways (VCSF). This model separately extracts temporal and spatial features from pure RGB video frames, and uses the complementary representations of the two pathways to conduct cross-pathway training. Additionally, we propose a motion perception module in the low-frame-rate space to enhance the network's ability to perceive rapidly changing human motion. We conducted extensive experiments in downstream missions of UCF101 and HMDB51, and obtained state-of-the-art results in models using the UCF101 data set for self-supervised pre-training, including motion recognition and nearest neighbor retrieval.

Original languageEnglish
Title of host publication2023 IEEE International Conference on Systems, Man, and Cybernetics
Subtitle of host publicationImproving the Quality of Life, SMC 2023 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2048-2053
Number of pages6
ISBN (Electronic)9798350337020
DOIs
StatePublished - 2023
Event2023 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2023 - Hybrid, Honolulu, United States
Duration: 1 Oct 20234 Oct 2023

Publication series

NameConference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
ISSN (Print)1062-922X

Conference

Conference2023 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2023
Country/TerritoryUnited States
CityHybrid, Honolulu
Period1/10/234/10/23

Keywords

  • Cross-pathway training
  • Self-supervised learning
  • Slow and Fast pathways

Fingerprint

Dive into the research topics of 'Video Self-Supervised Cross-Pathway Training Based on Slow and Fast Pathways'. Together they form a unique fingerprint.

Cite this