跳到主要导航 跳到搜索 跳到主要内容

Knowledge Synergy Learning for Multi-Modal Tracking

  • Xi'an Jiaotong University
  • Shenzhen Institute of Advanced Technology

科研成果: 期刊稿件文章同行评审

17 引用 (Scopus)

摘要

Benefiting from the rich information provided by different modalities, multi-modal tracking has shown significant improvements compared to single-modal tracking. However, in practical applications, multi-modal tracking still faces two major challenges. Firstly, it is crucial to effectively integrate the complementary information from different modalities in order to improve tracking performance. Secondly, as trackers are often deployed in dynamic environments, it is difficult to ensure complete multi-modal data. Thus, handling modal-missing issues is essential to achieve robust and reliable tracking. To address these challenges, this paper proposes a Knowledge Synergy Network (KSNet) that integrates multi-modal features into a comprehensive representation and incorporates a modal compensation mechanism to handle modal-missing issues. With this framework, a multi-modal tracker (KSTrack) is built and trained using multi-modal data. KSTrack is capable of handling both complete and incomplete multi-modal data during inference. Comprehensive experiments on four large-scale RGB-Thermal (RGB-T) and RGB-Depth (RGB-D) benchmarks show that KSTrack surpasses state-of-the-art multi-modal trackers when using multi-modal data and outperforms single-modal trackers by a large margin when using single-modal data.

源语言英语
页(从-至)5519-5532
页数14
期刊IEEE Transactions on Circuits and Systems for Video Technology
34
7
DOI
出版状态已出版 - 2024

学术指纹

探究 'Knowledge Synergy Learning for Multi-Modal Tracking' 的科研主题。它们共同构成独一无二的指纹。

引用此