Network adaptation for color image semantic segmentation

Taeg‐Hyun An; Jungyu Kang; Kyoung‐Wook Min

doi:10.1049/ipr2.12846

IET Image Processing (Aug 2023)

Network adaptation for color image semantic segmentation

Taeg‐Hyun An,
Jungyu Kang,
Kyoung‐Wook Min

Affiliations

Taeg‐Hyun An: Electronics and Telecommunications Research Institute DaejeonSouth Korea
Jungyu Kang: Electronics and Telecommunications Research Institute DaejeonSouth Korea
Kyoung‐Wook Min: Electronics and Telecommunications Research Institute DaejeonSouth Korea

DOI: https://doi.org/10.1049/ipr2.12846
Journal volume & issue: Vol. 17, no. 10
pp. 2972 – 2983

Abstract

Read online

Abstract Image analysis using deep learning has made significant progress in the last few decades, and the importance of pre‐processing input images has become evident. However, adapting a network structure suitable for input images has not been considered. In this study, a simple network adaptation method for color image analysis is described. The method is illustrated using semantic segmentation, which mainly takes a color image as input. The method is inspired by chrominance subsampling, which is a practical method for image and video analysis. The human visual system is less sensitive to color differences than it is to brightness, and based on this phenomenon, it is possible to improve existing networks by providing less resolution to chroma information than luminance information in the network encoder design by applying the group convolution concept. The proposed method helps to achieve improved results without changing the complexity of the baseline network model, and is especially helpful in applications with limited resources, such as autonomous driving, augmented reality. Experiments were performed on a combination of datasets (i.e. CamVid, Cityscapes and KITTI‐360) and networks (i.e. ENet, ERFNet, Deeplabv3plus with mobilenetv2). The results show that the method improves the performance of existing network structures without increasing the number of parameters.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords