Superpixel Image Classification with Graph Convolutional Neural Networks Based on Learnable Positional Embedding

Ji-Hun Bae; Gwang-Hyun Yu; Ju-Hwan Lee; Dang Thanh Vu; Le Hoang Anh; Hyoung-Gook Kim; Jin-Young Kim

doi:10.3390/app12189176

Applied Sciences (Sep 2022)

Superpixel Image Classification with Graph Convolutional Neural Networks Based on Learnable Positional Embedding

Ji-Hun Bae,
Gwang-Hyun Yu,
Ju-Hwan Lee,
Dang Thanh Vu,
Le Hoang Anh,
Hyoung-Gook Kim,
Jin-Young Kim

Affiliations

Ji-Hun Bae: Department of ICT Convergence System Engineering, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Korea
Gwang-Hyun Yu: Department of ICT Convergence System Engineering, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Korea
Ju-Hwan Lee: Department of ICT Convergence System Engineering, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Korea
Dang Thanh Vu: Department of ICT Convergence System Engineering, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Korea
Le Hoang Anh: Department of ICT Convergence System Engineering, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Korea
Hyoung-Gook Kim: Department of Electronic Convergence Engineering, Kwangwoon University, 20 Gwangun-ro, Nowon-gu, Seoul 01897, Korea
Jin-Young Kim: Department of ICT Convergence System Engineering, Chonnam National University, 77 Yongbong-ro, Buk-gu, Gwangju 61186, Korea

DOI: https://doi.org/10.3390/app12189176
Journal volume & issue: Vol. 12, no. 18
p. 9176

Abstract

Read online

Graph convolutional neural networks (GCNNs) have been successfully applied to a wide range of problems, including low-dimensional Euclidean structural domains representing images, videos, and speech and high-dimensional non-Euclidean domains, such as social networks and chemical molecular structures. However, in computer vision, the existing GCNNs are not provided with positional information to distinguish between graphs of new structures; therefore, the performance of the image classification domain represented by arbitrary graphs is significantly poor. In this work, we introduce how to initialize the positional information through a random walk algorithm and continuously learn the additional position-embedded information of various graph structures represented over the superpixel images we choose for efficiency. We call this method the graph convolutional network with learnable positional embedding applied on images (IMGCN-LPE). We apply IMGCN-LPE to three graph convolutional models (the Chebyshev graph convolutional network, graph convolutional network, and graph attention network) to validate performance on various benchmark image datasets. As a result, although not as impressive as convolutional neural networks, the proposed method outperforms various other conventional convolutional methods and demonstrates its effectiveness among the same tasks in the field of GCNNs.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords