跳到主要导航 跳到搜索 跳到主要内容

Space-time Prompting for Video Class-incremental Learning

  • Yixuan Pei
  • , Zhiwu Qing
  • , Shiwei Zhang
  • , Xiang Wang
  • , Yingya Zhang
  • , Deli Zhao
  • , Xueming Qian
  • Xi'an Jiaotong University
  • Huazhong University of Science and Technology
  • Alibaba Group Holding Ltd.
  • Ltd

科研成果: 书/报告/会议事项章节会议稿件同行评审

13 引用 (Scopus)

摘要

Recently, prompt-based learning has made impressive progress on image class-incremental learning, but it still lacks sufficient exploration in the video domain. In this paper, we will fill this gap by learning multiple prompts based on a powerful image-language pre-trained model, i.e., CLIP, making it fit for video class-incremental learning (VCIL). For this purpose, we present a space-time prompting approach (ST-Prompt) which contains two kinds of prompts, i.e., task-specific prompts and task-agnostic prompts. The task-specific prompts are to address the catastrophic forgetting problem by learning multi-grained prompts, i.e., spatial prompts, temporal prompts and comprehensive prompts, for accurate task identification. The task-agnostic prompts maintain a globally-shared prompt pool, which can empower the pre-trained image models with temporal perception abilities by exchanging contexts between frames. By this means, ST-Prompt can transfer the plentiful knowledge in the image-language pre-trained models to the VCIL task with only a tiny set of prompts to be optimized. To evaluate ST-Prompt, we conduct extensive experiments on three standard benchmarks. The results show that ST-Prompt can significantly surpass the state-of-the-art VCIL methods, especially it gains 9.06% on HMDB51 dataset under the 1 × 25 stage setting.

源语言英语
主期刊名Proceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
出版商Institute of Electrical and Electronics Engineers Inc.
11898-11908
页数11
ISBN(电子版)9798350307184
DOI
出版状态已出版 - 2023
活动2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Paris, 法国
期限: 2 10月 20236 10月 2023

出版系列

姓名Proceedings of the IEEE International Conference on Computer Vision
ISSN(印刷版)1550-5499

会议

会议2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
国家/地区法国
Paris
时期2/10/236/10/23

学术指纹

探究 'Space-time Prompting for Video Class-incremental Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此