Joint specular highlight detection and removal in single images via Unet-Transformer

Zhongqi Wu; Jianwei Guo; Chuanqing Zhuang; Jun Xiao; Dong-Ming Yan; Xiaopeng Zhang

doi:10.1007/s41095-022-0273-9

Computational Visual Media (Oct 2022)

Joint specular highlight detection and removal in single images via Unet-Transformer

Zhongqi Wu,
Jianwei Guo,
Chuanqing Zhuang,
Jun Xiao,
Dong-Ming Yan,
Xiaopeng Zhang

Affiliations

Zhongqi Wu: National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
Jianwei Guo: National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
Chuanqing Zhuang: The School of Artificial Intelligence, University of Chinese Academy of Sciences
Jun Xiao: The School of Artificial Intelligence, University of Chinese Academy of Sciences
Dong-Ming Yan: National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
Xiaopeng Zhang: National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences

DOI: https://doi.org/10.1007/s41095-022-0273-9
Journal volume & issue: Vol. 9, no. 1
pp. 141 – 154

Abstract

Read online

Abstract Specular highlight detection and removal is a fundamental problem in computer vision and image processing. In this paper, we present an efficient end-to-end deep learning model for automatically detecting and removing specular highlights in a single image. In particular, an encoder—decoder network is utilized to detect specular highlights, and then a novel Unet-Transformer network performs highlight removal; we append transformer modules instead of feature maps in the Unet architecture. We also introduce a highlight detection module as a mask to guide the removal task. Thus, these two networks can be jointly trained in an effective manner. Thanks to the hierarchical and global properties of the transformer mechanism, our framework is able to establish relationships between continuous self-attention layers, making it possible to directly model the mapping between the diffuse area and the specular highlight area, and reduce indeterminacy within areas containing strong specular highlight reflection. Experiments on public benchmark and real-world images demonstrate that our approach outperforms state-of-the-art methods for both highlight detection and removal tasks.

Published in Computational Visual Media

ISSN: 2096-0433 (Print); 2096-0662 (Online)
Publisher: SpringerOpen
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41095

About the journal

Abstract

Keywords