Visual Thinking of Neural Networks: Interactive Text to Image Synthesis

Hyunhee Lee; Gyeongmin Kim; Yuna Hur; Heuiseok Lim

doi:10.1109/ACCESS.2021.3074973

IEEE Access (Jan 2021)

Visual Thinking of Neural Networks: Interactive Text to Image Synthesis

Hyunhee Lee,
Gyeongmin Kim,
Yuna Hur,
Heuiseok Lim

Affiliations

Hyunhee Lee: ORCiD; Department of Computer Science and Engineering, Korea University, Seoul, South Korea
Gyeongmin Kim: ORCiD; Department of Computer Science and Engineering, Korea University, Seoul, South Korea
Yuna Hur: ORCiD; Department of Computer Science and Engineering, Korea University, Seoul, South Korea
Heuiseok Lim: ORCiD; Department of Computer Science and Engineering, Korea University, Seoul, South Korea

DOI: https://doi.org/10.1109/ACCESS.2021.3074973
Journal volume & issue: Vol. 9
pp. 64510 – 64523

Abstract

Read online

Reasoning, a trait of cognitive intelligence, is regarded as a crucial ability that distinguishes humans from other species. However, neural networks now pose a challenge to this human ability. Text-to-image synthesis is a class of vision and linguistics, wherein the goal is to learn multimodal representations between the image and text features. Hence, it requires a high-level reasoning ability that understands the relationships between objects in the given text and generates high-quality images based on the understanding. Text-to-image translation can be termed as the visual thinking of neural networks. In this study, our model infers the complicated relationships between objects in the given text and generates the final image by leveraging the previous history. We define diverse novel adversarial loss functions and finally demonstrate the best one that elevates the reasoning ability of the text-to-image synthesis. Remarkably, most of our models possess their own reasoning ability. Quantitative and qualitative comparisons with several methods demonstrate the superiority of our approach.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords