IEEE Access (Jan 2024)

How Can Human Hierarchical and Active Visual Perceptual Behavior Influence Scenic Image Categorization? A Deep Vision Model for Architectural Scenery Toward Education

  • Xing Yuan,
  • Jun Wang,
  • Chaohui Pang

DOI
https://doi.org/10.1109/ACCESS.2024.3443608
Journal volume & issue
Vol. 12
pp. 112545 – 112557

Abstract

Read online

Exploring the complex meanings in architectural scenic images is crucial for modern intelligent systems. We propose a novel method combining multi-channel perceptual features to semantically understand scenes with intricate spatial structures. Our deep hierarchical model captures human gaze dynamics using a binary objectness measure to detect objects and details at various architectural scales. We introduce a multi-layer active learning paradigm to derive gaze shifting paths (GSPs) and calculate corresponding deep representations, tolerant to label noise via a penalty term that discards irrelevant GSP features. A manifold-regularized feature selector is used to choose high-quality GSP features, and a linear classifier distinguishes architectural scenes. Our technique outperforms traditional models, as shown in tests on three image retrieval datasets, demonstrating the discriminative capacity of our deep gaze-guided features.

Keywords