Survey of AI Painting

ZHANG Zeyu, WANG Tiejun, GUO Xiaoran, LONG Zhilei, XU Kui

doi:10.3778/j.issn.1673-9418.2401075

Jisuanji kexue yu tansuo (Jun 2024)

Survey of AI Painting

ZHANG Zeyu, WANG Tiejun, GUO Xiaoran, LONG Zhilei, XU Kui

Affiliations

ZHANG Zeyu, WANG Tiejun, GUO Xiaoran, LONG Zhilei, XU Kui: 1. Key Laboratory of China??s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, Lanzhou 730030, China 2. School of Mathematics and Computer Science, Northwest Minzu University, Lanzhou 730030, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.2401075
Journal volume & issue: Vol. 18, no. 6
pp. 1404 – 1420

Abstract

Read online

AI painting, as a popular research direction in the field of computer vision, is expanding its application boundaries in the fields of art creation, film and media, industrial design, and art education through natural language processing, graphic pre-training models, and diffusion models. Two types of AI painting, namely, image-to-image and text-to-image, are taken as the main lines, and the representative models and their key technologies and methods are analyzed in depth. For the image-to-image, the development lineage, generation principle, and advantages and disadvantages of each model are explored from two types of models based on AE and GAN, and their effects on the public dataset are summarized. For the text-to-image, the structural differences of the three types of models based on diffusion model and other models, as well as the generation effects of various types of models on three datasets are summarized. It is pointed out that the text-to-image utilizing the diffusion model has become a hot topic nowadays, which predicts the diversified development of image generation in the future. And the current mainstream AI painting platforms are compared and summarized from the perspectives of usage and generation speed. Finally, on the basis of summarizing the problems and controversies faced by AI painting at the technical and social levels, future trends such as the complementary development of AI painting and human artists, the increased interactivity of the painting process, and the emergence of new professions and industries are envisioned.

ai painting; image-to-image; text-to-image; image generation; artificial intelligence generated content (aigc)

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords