Automatic extraction and 3D modeling of real road scenes using UAV imagery and deep learning semantic segmentation

Zhen Liu; Wenxiu Wu; Danyu Wang; Bingyan Cui; Xingyu Gu

doi:10.1080/17538947.2024.2365970

International Journal of Digital Earth (Dec 2024)

Automatic extraction and 3D modeling of real road scenes using UAV imagery and deep learning semantic segmentation

Zhen Liu,
Wenxiu Wu,
Danyu Wang,
Bingyan Cui,
Xingyu Gu

Affiliations

Zhen Liu: Department of Roadway Engineering, School of Transportation, Southeast University, Nanjing, People’s Republic of China
Wenxiu Wu: Highway and Transportation Management Center, Jinhua, People’s Republic of China
Danyu Wang: Department of Roadway Engineering, School of Transportation, Southeast University, Nanjing, People’s Republic of China
Bingyan Cui: Department of Civil and Environmental Engineering, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
Xingyu Gu: Department of Roadway Engineering, School of Transportation, Southeast University, Nanjing, People’s Republic of China

DOI: https://doi.org/10.1080/17538947.2024.2365970
Journal volume & issue: Vol. 17, no. 1

Abstract

Read online

The extraction of roads from UAV images is challenged by lighting, noise, occlusions, and similar non-road objects, making high-quality road extraction difficult. To addressing these issues, this study proposes an enhanced U-Net network to automate the extraction and 3D modeling of real road scenes using UAV imagery. Initially, a cascaded atrous spatial pyramid module was integrated into the encoder to capitalize on global context information, thereby refining the fuzzy segmentation outcomes. Subsequently, a module for augmenting road feature extraction was added within the channel, and a spatial attention mechanism was introduced in the decoder to enhance edge clarity. Experimental results demonstrated that this model captures more road information compared to mainstream networks and effectively incorporates topological structure perception for road extraction in complex scenarios, thus improving road connectivity. The model achieved an F1 score and mean Intersection over Union (mIoU) of 85.6% and 81.2%, respectively, on UAV images of road scenes – marking improvements of 3.9% and 3.4% over the traditional U-Net model, thereby exhibiting superior automatic road extraction capabilities. Ultimately, the model facilitated refined modeling and visual analysis of road scenes, achieving high overall accuracy and detailed local restoration of the actual scene.

Published in International Journal of Digital Earth

ISSN: 1753-8947 (Print); 1753-8955 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Geography. Anthropology. Recreation: Mathematical geography. Cartography
Website: https://www.tandfonline.com/journals/tjde

About the journal

Abstract

Keywords