Boosting Few-Shot Remote Sensing Image Scene Classification with Language-Guided Multimodal Prompt Tuning

  • Haixia Bi
  • , Zhangwei Gao
  • , Kang Liu
  • , Qian Song
  • , Xiaotian Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Remote sensing image Scene classification is an important research topic in remote sensing community and has evoked a growing concern with the recent development of deep learning techniques. However, the requirement of a large amount of annotations brings great challenges to deep learning-based scene classification approaches. Visual-linguistic pretraining models, which improve the transferability of visual models using the supervision information of text, create a new way for the task under label scarcity scenario. In this paper, we explore the novel approach of prompt engineering, aiming to achieve satisfactory performance of multi-modal pretraining models on downstream remote sensing image scene classification task with minimal amounts of training data. Experiments were conducted on multiple publicly available datasets. The results indicate that training the learnable prompts with a small number of samples can yield impressive results, surpassing the few-shot transfer learning results of the best-performing pre-trained models.

Original languageEnglish
Title of host publicationProceedings of 2023 International Conference on New Trends in Computational Intelligence, NTCI 2023
EditorsJian Wang, Marios M. Polycarpou
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages293-297
Number of pages5
ISBN (Electronic)9798350380859
DOIs
StatePublished - 2023
Event2023 International Conference on New Trends in Computational Intelligence, NTCI 2023 - Qingdao, China
Duration: 3 Nov 20235 Nov 2023

Publication series

NameProceedings of 2023 International Conference on New Trends in Computational Intelligence, NTCI 2023

Conference

Conference2023 International Conference on New Trends in Computational Intelligence, NTCI 2023
Country/TerritoryChina
CityQingdao
Period3/11/235/11/23

Keywords

  • Few-shot learning
  • Multi-modal pretraining
  • Prompt tuning
  • Remote sensing image scene classification

Fingerprint

Dive into the research topics of 'Boosting Few-Shot Remote Sensing Image Scene Classification with Language-Guided Multimodal Prompt Tuning'. Together they form a unique fingerprint.

Cite this