Scientific Reports (Nov 2024)

Leveraging a deep learning generative model to enhance recognition of minor asphalt defects

  • Saúl Cano-Ortiz,
  • Eugenio Sainz-Ortiz,
  • Lara Lloret Iglesias,
  • Pablo Martínez Ruiz del Árbol,
  • Daniel Castro-Fresno

DOI
https://doi.org/10.1038/s41598-024-80199-3
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Deep learning-based computer vision systems have become powerful tools for automated and cost-effective pavement distress detection, essential for efficient road maintenance. Current methods focus primarily on developing supervised learning architectures, which are limited by the scarcity of annotated image datasets. The use of data augmentation with synthetic images created by generative models to improve these supervised systems is not widely explored. The few studies that do focus on generative architectures are mostly non-conditional, requiring extra labeling, and typically address only road crack defects while aiming to improve classification models rather than object detection. This study introduces AsphaltGAN, a novel class-conditional Generative Adversarial Network with attention mechanisms, designed to augment datasets with various rare road defects to enhance object detection. An in-depth analysis evaluates the impact of different loss functions and hyperparameter tuning. The optimized AsphaltGAN outperforms state-of-the-art generative architectures on public datasets. Additionally, a new workflow is proposed to improve object detection models using synthetic road images. The augmented datasets significantly improve the object detection metrics of You Only Look Once version 8 by 33.0 $$\%$$ , 3.8 $$\%$$ , 46.3 $$\%$$ , and 51.8 $$\%$$ on the Road Damage Detection 2022 dataset, Crack Dataset, Asphalt Pavement Detection Dataset, and Crack Surface Dataset, respectively.

Keywords