Important Region Estimation Using Image Captioning

Taku Suzuki; Daisuke Sato; Yoshihiro Sugaya; Tomo Miyazaki; Shinichiro Omachi

doi:10.1109/ACCESS.2022.3211260

IEEE Access (Jan 2022)

Important Region Estimation Using Image Captioning

Taku Suzuki,
Daisuke Sato,
Yoshihiro Sugaya,
Tomo Miyazaki,
Shinichiro Omachi

Affiliations

Taku Suzuki: Graduate School of Engineering, Tohoku University, Sendai, Japan
Daisuke Sato: Graduate School of Engineering, Tohoku University, Sendai, Japan
Yoshihiro Sugaya: ORCiD; Faculty of Advanced Science and Technology, Ryukoku University, Otsu, Japan
Tomo Miyazaki: ORCiD; Graduate School of Engineering, Tohoku University, Sendai, Japan
Shinichiro Omachi: ORCiD; Graduate School of Engineering, Tohoku University, Sendai, Japan

DOI: https://doi.org/10.1109/ACCESS.2022.3211260
Journal volume & issue: Vol. 10
pp. 105546 – 105555

Abstract

Read online

When storing images and videos on a limited storage device or transmitting them over a narrow-band network, an effective approach is to detect the necessary parts and process them preferentially. Visual saliency has often been used for this purpose, and many methods have been proposed to detect salient objects. However, a salient object is not necessarily the primary subject in an image. Determining the important regions in an image is not clear or easy to achieve because it generally depends on the context of the image. In this study, we propose a novel framework for detecting important image regions. We leverage an image-captioning technique because it interprets the context of an image when generating sentences. The proposed method determines those important regions that are closer to the level of human sensitivity by exploiting semantic information from the image captioning. To evaluate the effectiveness of the proposed method, we created a dataset that defines important regions within images based on experiments using subjective evaluation. Applying this dataset, we confirmed that the accuracy of the proposed approach was higher than that of conventional saliency-based object detection methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords