Transformer-Based Zero-Shot Detection via Contrastive Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Zero-Shot Detection (ZSD) is a challenging computer vision problem that enables simultaneous classification and localization of previously unseen objects via auxiliary information. Most of the existing methods learn a biased visual-semantic mapping function, which prefers predicting seen classes during testing, and they only focus on region of interest and ignore contextual information in an image. To tackle these problems, we propose a novel framework for ZSD named Transformer-based Zero-Shot Detection via Contrastive Learning (TZSDC). The proposed TZSDC contains four components: transformer-based backbone, Foreground-Background (FB) separation module, Instance-Instance Contrastive Learning (IICL) module, and Knowledge-Transfer (KT) module. The transformer backbone encodes long-range contextual information with less inductive bias. The FB module separates foreground and background by scoring objectness from images. The IICL module optimizes the visual structure in embedding space to make it more discriminative and the KT module transfers knowledge from seen classes to unseen classes via category similarity. Benefiting from these modules, the accurate alignment between the contextual visual features and semantic features can be achieved. Experiments on MSCOCO well validate the effectiveness of the proposed method for ZSD and generalized ZSD.

Original languageEnglish
Title of host publicationArtificial Intelligence Applications and Innovations - 18th IFIP WG 12.5 International Conference, AIAI 2022, Proceedings
EditorsIlias Maglogiannis, Lazaros Iliadis, John Macintyre, Paulo Cortez
PublisherSpringer Science and Business Media Deutschland GmbH
Pages316-327
Number of pages12
ISBN (Print)9783031083327
DOIs
StatePublished - 2022
Event18th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2022 - Hersonissos, Greece
Duration: 17 Jun 202220 Jun 2022

Publication series

NameIFIP Advances in Information and Communication Technology
Volume646 IFIP
ISSN (Print)1868-4238
ISSN (Electronic)1868-422X

Conference

Conference18th IFIP WG 12.5 International Conference on Artificial Intelligence Applications and Innovations, AIAI 2022
Country/TerritoryGreece
CityHersonissos
Period17/06/2220/06/22

Keywords

  • Contrastive learning
  • Transformer
  • Zero-Shot Detection

Fingerprint

Dive into the research topics of 'Transformer-Based Zero-Shot Detection via Contrastive Learning'. Together they form a unique fingerprint.

Cite this