Learning graph structures with transformer for weakly supervised semantic segmentation

Wanchun Sun; Xin Feng; Hui Ma; Jingyao Liu

doi:10.1007/s40747-023-01152-x

Complex & Intelligent Systems (Jul 2023)

Learning graph structures with transformer for weakly supervised semantic segmentation

Wanchun Sun,
Xin Feng,
Hui Ma,
Jingyao Liu

Affiliations

Wanchun Sun: School of Computer Science and Technology, Changchun University of Science and Technology
Xin Feng: School of Computer Science and Technology, Changchun University of Science and Technology
Hui Ma: Computer Basic Teaching and Research Department, Anhui Vocational College of Police Officers
Jingyao Liu: School of Computer Science and Technology, Changchun University of Science and Technology

DOI: https://doi.org/10.1007/s40747-023-01152-x
Journal volume & issue: Vol. 9, no. 6
pp. 7511 – 7521

Abstract

Read online

Abstract Weakly supervised semantic segmentation (WSSS) is a challenging task of computer vision. The state-of-the-art semantic segmentation methods are usually based on the convolutional neural network (CNN), which mainly have the drawbacks of inability to explore the global information correctly and failure to activate potential object regions. To avoid such drawbacks, the transformer approach is explored in the WSSS task, but no effective semantic association between different patch tokens can be determined in the transformer. To address this issue, inspired by the graph convolutional network (GCN), this paper proposes a graph structure to learn the semantic category relationships between different blocks in the vector sequence. To verify the effectiveness of the proposed method in this paper, a large number of experiments were conducted on the publicly available PASCAL VOC2012 dataset. The experimental results show that our proposed method achieves significant performance improvement in the WSSS task and outperforms other state-of-the-art transformer-based methods.

Published in Complex & Intelligent Systems

ISSN: 2199-4536 (Print); 2198-6053 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://www.springer.com/journal/40747

About the journal

Abstract

Keywords