Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

Weitao Liu; Junjun Wu

doi:10.1007/s44196-024-00630-5

International Journal of Computational Intelligence Systems (Nov 2024)

Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

Weitao Liu,
Junjun Wu

Affiliations

Weitao Liu: Guangdong Provincial Key Laboratory of Industrial Intelligent Inspection Technology, Foshan University
Junjun Wu: Guangdong Provincial Key Laboratory of Industrial Intelligent Inspection Technology, Foshan University

DOI: https://doi.org/10.1007/s44196-024-00630-5
Journal volume & issue: Vol. 17, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Semantic segmentation is a crucial technology for intelligent vehicles, enabling scene understanding in complex driving environments. However, complex real-world scenarios often contain diverse multi-scale objects, which bring challenges to the accurate semantic segmentation. To address this challenge, we propose a multi-level features self-attention fusion module called Channel2DTransformer. The module utilizes self-attention mechanisms to dynamically fuse multi-level features by computing self-attention weights between their channels, resulting in a consistent and comprehensive representation of scene features. We perform the module on the Cityscapes and NYUDepthV2 datasets, which contain a large number of multi-scale objects. The experimental results validate the positive contributions of the module in enhancing the semantic segmentation accuracy of multi-scale objects and improving the performance of semantic segmentation in complex scenes.

Published in International Journal of Computational Intelligence Systems

ISSN: 1875-6891 (Print); 1875-6883 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.springer.com/journal/44196

About the journal

Abstract

Keywords