跳到主要导航 跳到搜索 跳到主要内容

GSLTA-CDFSAR: Global Sequences and Local Tuples Alignment for Cross-Domain Few-Shot Action Recognition

  • Xi'an Jiaotong University
  • Shenzhen Institute of Advanced Technology

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

摘要

Few-shot action recognition (FSAR) has made substantial progress, however it primarily addresses problems within a single domain. Its effectiveness is often questioned when applied across different domains. This is mainly due to inductive biases in data distribution during the meta-training, including spatial and temporal distribution biases. These combined biases further complicate the adaptation issue in videos, making it challenging for models trained in one domain to adapt to another. In order to deal with this problem, we first enhance the source domain videos with frames from unlabeled target domain videos. Then, we employ a dual-branch structure to process the videos. The first branch, named the Domain Temporal branch, simultaneously handles global sequences of videos from both the source and target domains, while the second branch, named the Local-Global Adapter branch, compares local tuples of videos with global sequences from the source domain. We align the meta-learning results of the source domain from the first branch with that from the second branch, enabling us to obtain domain-invariant information solely from the source domain. Concurrently, in the first branch, we perform a reconstruction operation for the target domain videos, allowing the model to extract features that approach the target domain. Our code is available on: https://github.com/cofly2014/GSLTA.git.

源语言英语
文章编号113041
期刊Knowledge-Based Systems
311
DOI
出版状态已出版 - 28 2月 2025

学术指纹

探究 'GSLTA-CDFSAR: Global Sequences and Local Tuples Alignment for Cross-Domain Few-Shot Action Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此