IEEE Access (Jan 2020)

An Interactive Image Editing System Using an Uncertainty-Based Confirmation Strategy

  • Seitaro Shinagawa,
  • Koichiro Yoshino,
  • Seyed Hossein Alavi,
  • Kallirroi Georgila,
  • David Traum,
  • Sakriani Sakti,
  • Satoshi Nakamura

DOI
https://doi.org/10.1109/ACCESS.2020.2997012
Journal volume & issue
Vol. 8
pp. 98471 – 98480

Abstract

Read online

We propose an interactive image editing system that has a confirmation dialogue strategy using an entropy-based uncertainty calculation on its generated images with Deep Convolutional Generative Adversarial Networks (DCGAN). DCGAN is an image generative model that learns an image manifold of a given dataset and enables continuous change of an image. Our proposed image editing system combines DCGAN with a natural language interface that accepts image editing requests in natural language. Although such a system is helpful for human users, it often faces uncertain requests to generate acceptable images. A promising approach to solve this problem is introducing a dialogue process that shows multiple candidates and confirms the user's intention. However, confirming every editing request creates redundant dialogues. To achieve more efficient dialogues, we propose an entropy-based dialogue strategy that decides when the system should confirm, and enables effective image editing through a dialogue that reduces redundant confirmations. We conducted image editing dialogue experiments using an avatar face illustration dataset for editing by natural language requests. Through quantitative and qualitative analysis, our results show that our entropy-based confirmation strategy achieved an effective dialogue by generating images desired by users.

Keywords