Review of Image Data Augmentation in Computer Vision

LIN Chengchuang, SHAN Chun, ZHAO Gansen, YANG Zhirong, PENG Jing, CHEN Shaojie, HUANG Runhua, LI Zhuangwei, YI Xusheng, DU Jiahua, LI Shuangyin, LUO Haoyu, FAN Xiaomao, CHEN Bingchuan

doi:10.3778/j.issn.1673-9418.2102015

Jisuanji kexue yu tansuo (May 2021)

Review of Image Data Augmentation in Computer Vision

LIN Chengchuang, SHAN Chun, ZHAO Gansen, YANG Zhirong, PENG Jing, CHEN Shaojie, HUANG Runhua, LI Zhuangwei, YI Xusheng, DU Jiahua, LI Shuangyin, LUO Haoyu, FAN Xiaomao, CHEN Bingchuan

Affiliations

LIN Chengchuang, SHAN Chun, ZHAO Gansen, YANG Zhirong, PENG Jing, CHEN Shaojie, HUANG Runhua, LI Zhuangwei, YI Xusheng, DU Jiahua, LI Shuangyin, LUO Haoyu, FAN Xiaomao, CHEN Bingchuan: 1. School of Computer Science, South China Normal University, Guangzhou 510631, China 2. School of Electronics and Information, Guangdong Polytechnic Normal University, Guangzhou 510665, China 3. Norwegian University of Science and Technology, Trondheim 17491, Norway 4. Key Lab on Cloud Security and Assessment Technology of Guangzhou, Guangzhou 510631, China 5. South China Normal University & VeChain Joint Lab on BlockChain Technology and Application, Guangzhou 510631, China 6. School of Statistics and Mathematics, Guangdong University of Finance and Economics, Guangzhou 510320, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.2102015
Journal volume & issue: Vol. 15, no. 5
pp. 583 – 611

Abstract

Read online

Deep learning is a promising solution for computer vision at present. To solve the computer vision problem, it requires massive and high-quality image training datasets. Collecting and accurately labeling image datasets is a very time-consuming and expensive process. As computer vision applications become more widespread, it makes this problem even more pronounced. Image augmentation technologies are technical methods to effectively solve the problem of deep learning training under the condition of small-scale or low-quality training data. These technologies are continually accompanied with the development of deep learning and computer vision. This paper first reviews these image augmentation researches from the perspective of augmentation objects, operation spaces, label processing methods, and augmentation strategies and then concludes corresponding paradigms of current image data augmentation methods. After that, this paper proposes a taxonomy for current image data augmentation guided by the above paradigms, and reviews corresponding representative methods of each image data augmentation category. Finally, this paper makes conclusions on existing image data augmentation, points out the problems existing in the current image augmentation research and presents promising directions for future research.

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords