Yolo V4 for Advanced Traffic Sign Recognition With Synthetic Training Data Generated by Various GAN

Christine Dewi; Rung-Ching Chen; Yan-Ting Liu; Xiaoyi Jiang; Kristoko Dwi Hartomo

doi:10.1109/ACCESS.2021.3094201

IEEE Access (Jan 2021)

Yolo V4 for Advanced Traffic Sign Recognition With Synthetic Training Data Generated by Various GAN

Christine Dewi,
Rung-Ching Chen,
Yan-Ting Liu,
Xiaoyi Jiang,
Kristoko Dwi Hartomo

Affiliations

Christine Dewi: ORCiD; Department of Information Management, Chaoyang University of Technology, Taichung, Taiwan
Rung-Ching Chen: ORCiD; Department of Information Management, Chaoyang University of Technology, Taichung, Taiwan
Yan-Ting Liu: Department of Information Management, Chaoyang University of Technology, Taichung, Taiwan
Xiaoyi Jiang: ORCiD; Department of Mathematics and Computer Science, University of Münster, Münster, Germany
Kristoko Dwi Hartomo: ORCiD; Faculty of Information Technology, Satya Wacana Christian University, Central Java, Salatiga, Indonesia

DOI: https://doi.org/10.1109/ACCESS.2021.3094201
Journal volume & issue: Vol. 9
pp. 97228 – 97242

Abstract

Read online

Convolutional Neural Networks (CNN) achieves perfection in traffic sign identification with enough annotated training data. The dataset determines the quality of the complete visual system based on CNN. Unfortunately, databases for traffic signs from the majority of the world’s nations are few. In this scenario, Generative Adversarial Networks (GAN) may be employed to produce more realistic and varied training pictures to supplement the actual arrangement of images. The purpose of this research is to describe how the quality of synthetic pictures created by DCGAN, LSGAN, and WGAN is determined. Our work combines synthetic images with original images to enhance datasets and verify the effectiveness of synthetic datasets. We use different numbers and sizes of images for training. Likewise, the Structural Similarity Index (SSIM) and Mean Square Error (MSE) were employed to assess picture quality. Our study quantifies the SSIM difference between the synthetic and actual images. When additional images are used for training, the synthetic image exhibits a high degree of resemblance to the genuine image. The highest SSIM value was achieved when using 200 total images as input and $32\times 32$ image size. Further, we augment the original picture dataset with synthetic pictures and compare the original image model to the synthesis image model. For this experiment, we are using the latest iterations of Yolo, Yolo V3, and Yolo V4. After mixing the real image with the synthesized image produced by LSGAN, the recognition performance has been improved, achieving an accuracy of 84.9% on Yolo V3 and an accuracy of 89.33% on Yolo V4.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords