Cross‐scale resolution consistent network for salient object detection

Xiaoyu Huang; Wei Liu; Minghui Li; Hangyu Nie

doi:10.1049/ipr2.13136

IET Image Processing (Aug 2024)

Cross‐scale resolution consistent network for salient object detection

Xiaoyu Huang,
Wei Liu,
Minghui Li,
Hangyu Nie

Affiliations

Xiaoyu Huang: School of Computer Science and Engineering Wuhan Institute of Technology Wuhan China
Wei Liu: School of Computer Science and Engineering Wuhan Institute of Technology Wuhan China
Minghui Li: School of Computer Science and Engineering Wuhan Institute of Technology Wuhan China
Hangyu Nie: School of Computer Science and Engineering Wuhan Institute of Technology Wuhan China

DOI: https://doi.org/10.1049/ipr2.13136
Journal volume & issue: Vol. 18, no. 10
pp. 2788 – 2799

Abstract

Read online

Abstract The salient object detection task tries to simulate the human visual system for most eye‐catching objects or regions detection. However, due to the complexity of the visual mechanisms, current methods will suffer from severe performance degradation, leading to inconsistent prediction results for the same regions, when directly adopting a model trained on a fixed resolution to evaluate at other different resolutions. Considering that consistency in predictions is essential for salient object detection, a cross‐scale resolution consistent salient object detection method, called RCNet, is proposed. Specifically, to enhance the model's capacity for generalization across images of varying resolutions and make the model implicitly learn the scale invariance, a multi‐resolution data enhancement module is constructed to generate images with arbitrary resolutions for the same scene. Moreover, to accomplish better multi‐level feature fusion, a cross‐scale fusion module is developed to fuse high‐level semantic features and low‐level detail features. Additionally, to explicitly learn the scale invariance of the salient scores, a hybrid salient consistency loss is formulated on salient object detection with different resolutions. Comprehensive evaluations on five benchmark datasets show that RCNet achieves a highly competitive result.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords