Multi-class Token-Guided End-to-End Weakly Supervised Image Semantic Segmentation Method

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Weakly supervised image semantic segmentation has become the most popular method in recent years because of its low cost and has been widely used in medical image segmentation, automatic driving, remote sensing image analysis and other fields. However, the current weakly supervised semantic segmentation based on transformer has some problems, such as focusing on the whole, ignoring local details and confusing different categories. To solve these problems, we come up with a token-guided single stage weakly supervised image semantic segmentation algorithm. First of all, in order to solve the problem of insufficient attention to details, we proposed an optimization clipping method, which realized the selection of uncertain regions as much as possible and the fine marking of uncertain regions. Then, the single-class token to multiple class tokens method is purposed to obtain multiple class tokens for fine guidance. In particular, we designed a multiple class tokens guide method to complete the function of classifying uncertain regions and correctly activating them. The quantitative and qualitative results of the public dataset PASCAL VOC 2012 validate the effectiveness of the method.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings
EditorsZhouchen Lin, Hongbin Zha, Ming-Ming Cheng, Ran He, Cheng-Lin Liu, Kurban Ubul, Wushouer Silamu, Jie Zhou
PublisherSpringer Science and Business Media Deutschland GmbH
Pages93-106
Number of pages14
ISBN (Print)9789819784929
DOIs
StatePublished - 2025
Event7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024 - Urumqi, China
Duration: 18 Oct 202420 Oct 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15043 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024
Country/TerritoryChina
CityUrumqi
Period18/10/2420/10/24

Keywords

  • Multi-class tokens
  • Regional guidance
  • Semantic segmentation
  • Weakly supervision

Fingerprint

Dive into the research topics of 'Multi-class Token-Guided End-to-End Weakly Supervised Image Semantic Segmentation Method'. Together they form a unique fingerprint.

Cite this