TY - GEN
T1 - Multi-class Token-Guided End-to-End Weakly Supervised Image Semantic Segmentation Method
AU - Cao, Yifan
AU - He, Lijun
AU - Ma, Ting
AU - Li, Fan
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Weakly supervised image semantic segmentation has become the most popular method in recent years because of its low cost and has been widely used in medical image segmentation, automatic driving, remote sensing image analysis and other fields. However, the current weakly supervised semantic segmentation based on transformer has some problems, such as focusing on the whole, ignoring local details and confusing different categories. To solve these problems, we come up with a token-guided single stage weakly supervised image semantic segmentation algorithm. First of all, in order to solve the problem of insufficient attention to details, we proposed an optimization clipping method, which realized the selection of uncertain regions as much as possible and the fine marking of uncertain regions. Then, the single-class token to multiple class tokens method is purposed to obtain multiple class tokens for fine guidance. In particular, we designed a multiple class tokens guide method to complete the function of classifying uncertain regions and correctly activating them. The quantitative and qualitative results of the public dataset PASCAL VOC 2012 validate the effectiveness of the method.
AB - Weakly supervised image semantic segmentation has become the most popular method in recent years because of its low cost and has been widely used in medical image segmentation, automatic driving, remote sensing image analysis and other fields. However, the current weakly supervised semantic segmentation based on transformer has some problems, such as focusing on the whole, ignoring local details and confusing different categories. To solve these problems, we come up with a token-guided single stage weakly supervised image semantic segmentation algorithm. First of all, in order to solve the problem of insufficient attention to details, we proposed an optimization clipping method, which realized the selection of uncertain regions as much as possible and the fine marking of uncertain regions. Then, the single-class token to multiple class tokens method is purposed to obtain multiple class tokens for fine guidance. In particular, we designed a multiple class tokens guide method to complete the function of classifying uncertain regions and correctly activating them. The quantitative and qualitative results of the public dataset PASCAL VOC 2012 validate the effectiveness of the method.
KW - Multi-class tokens
KW - Regional guidance
KW - Semantic segmentation
KW - Weakly supervision
UR - https://www.scopus.com/pages/publications/85210002667
U2 - 10.1007/978-981-97-8493-6_7
DO - 10.1007/978-981-97-8493-6_7
M3 - 会议稿件
AN - SCOPUS:85210002667
SN - 9789819784929
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 93
EP - 106
BT - Pattern Recognition and Computer Vision - 7th Chinese Conference, PRCV 2024, Proceedings
A2 - Lin, Zhouchen
A2 - Zha, Hongbin
A2 - Cheng, Ming-Ming
A2 - He, Ran
A2 - Liu, Cheng-Lin
A2 - Ubul, Kurban
A2 - Silamu, Wushouer
A2 - Zhou, Jie
PB - Springer Science and Business Media Deutschland GmbH
T2 - 7th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2024
Y2 - 18 October 2024 through 20 October 2024
ER -