A multimodal feature fusion image dehazing method with scene depth prior

Zhang Zhengpeng; Cheng Yan; Zhang Shuai; Bu Lijing; Deng Mingjun

doi:10.1049/ipr2.12866

IET Image Processing (Sep 2023)

A multimodal feature fusion image dehazing method with scene depth prior

Zhang Zhengpeng,
Cheng Yan,
Zhang Shuai,
Bu Lijing,
Deng Mingjun

Affiliations

Zhang Zhengpeng: School of Automation and Electronic Information Xiangtan University XiangtanChina
Cheng Yan: School of Automation and Electronic Information Xiangtan University XiangtanChina
Zhang Shuai: School of Geomatics Liaoning Technical University FuxinLiaoningChina
Bu Lijing: School of Automation and Electronic Information Xiangtan University XiangtanChina
Deng Mingjun: School of Automation and Electronic Information Xiangtan University XiangtanChina

DOI: https://doi.org/10.1049/ipr2.12866
Journal volume & issue: Vol. 17, no. 11
pp. 3079 – 3094

Abstract

Read online

Abstract Current dehazing networks usually only learn haze features in a single‐image colour space and often suffer from uneven dehazing, colour, and edge degradation when confronted with different scales of ground objects in the depth space of the scene. The authors propose a multimodal feature fusion image dehazing method with scene depth prior based on a decoder–encoder backbone network. The multimodal feature fusion module was first designed. In this module, affine transformation and polarized self‐attention mechanism are used to realize the fusion of image colour and depth prior feature, to improve the representation ability of the model for different scale ground haze feature in‐depth space. Then, the feature enhancement module (FEM) is added, and deformable convolution and difference convolution methods are used to enhance the representation ability of the model for the geometric and texture feature of the ground objects. The publicly available dehazing datasets are used for comparison and ablation experiments. The results show that compared with the existing classical dehazing networks, the peak signal‐to‐noise ratio (PSNR) and SSIM of the authors’ proposed method have been significantly improved, have a more uniform dehazing effect in different depth spaces, and maintain the colour and edge details of the ground objects very well.

Published in IET Image Processing

ISSN: 1751-9659 (Print); 1751-9667 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Photography; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17519667

About the journal

Abstract

Keywords