Component Identification and Depth Estimation for Structural Images Based on Multi-Scale Task Interaction Network

Jianlong Ye; Hongchuan Yu; Gaoyang Liu; Jiong Zhou; Jiangpeng Shu

doi:10.3390/buildings14040983

Buildings (Apr 2024)

Component Identification and Depth Estimation for Structural Images Based on Multi-Scale Task Interaction Network

Jianlong Ye,
Hongchuan Yu,
Gaoyang Liu,
Jiong Zhou,
Jiangpeng Shu

Affiliations

Jianlong Ye: Zhejiang Communications Group Inspection Technology, Co., Ltd., Hangzhou 310058, China
Hongchuan Yu: College of Civil Engineering and Architecture, Zhejiang University, Hangzhou 310058, China
Gaoyang Liu: School of Civil Engineering, Shaoxing University, Shaoxing 312000, China
Jiong Zhou: Zhejiang Communications Group Inspection Technology, Co., Ltd., Hangzhou 310058, China
Jiangpeng Shu: College of Civil Engineering and Architecture, Zhejiang University, Hangzhou 310058, China

DOI: https://doi.org/10.3390/buildings14040983
Journal volume & issue: Vol. 14, no. 4
p. 983

Abstract

Read online

Component identification and depth estimation are important for detecting the integrity of post-disaster structures. However, traditional manual methods might be time-consuming, labor-intensive, and influenced by subjective judgments of inspectors. Deep-learning-based image visual inspection is a new approach to overcome these problems, but repeated modeling is required for different inspection tasks, which limits inspection accuracy and practical deployment efficiency. In this study, it is observed that the matched ratios of pixel pairs between component identification and depth estimation reach a high value, which indicates the dual tasks are highly related. Therefore, the Multi-Scale Task Interaction Network (MTI-Net) is proposed for structural images to simultaneously accomplish both tasks for accurate and efficient structural inspection. It propagates distilled task information from lower to higher scales. Then, it aggregates the refined task features from all scales to produce the final per-task predictions. Experiments show that MTI-Net delivers the full potential of multi-task learning, with a smaller memory footprint and higher efficiency compared to single-task learning. For the evaluation metrics of model performance, the mean Intersection over Union (mIoU) of component identification improves by 2.30, and root mean square error (RMSE) drops by 0.36 m with the aid of the multi-task strategy. The multi-task deep learning framework has great potential value in engineering applications.

Published in Buildings

ISSN: 2075-5309 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Building construction
Website: https://www.mdpi.com/journal/buildings

About the journal

Abstract

Keywords