Detection and classification of surface defects on hot-rolled steel using vision transformers

Vinod Vasan; Naveen Venkatesh Sridharan; Sugumaran Vaithiyanathan; Mohammadreza Aghaei

Heliyon (Oct 2024)

Detection and classification of surface defects on hot-rolled steel using vision transformers

Vinod Vasan,
Naveen Venkatesh Sridharan,
Sugumaran Vaithiyanathan,
Mohammadreza Aghaei

Affiliations

Vinod Vasan: School of Mechanical Engineering (SMEC), Vellore Institute of Technology Chennai Campus, Vandalur Kelambakkam Road, Chennai, 600127, India
Naveen Venkatesh Sridharan: Division of Operation and Maintenance Engineering, Luleå University of Technology, 97187, Luleå, Sweden
Sugumaran Vaithiyanathan: School of Mechanical Engineering (SMEC), Vellore Institute of Technology Chennai Campus, Vandalur Kelambakkam Road, Chennai, 600127, India
Mohammadreza Aghaei: Department of Ocean Operations and Civil Engineering, Norwegian University of Science and Technology (NTNU), 6009, Ålesund, Norway; Department of Sustainable Systems Engineering (INATECH), University of Freiburg, 79110, Freiburg, Germany; Corresponding author. Department of Ocean Operations and Civil Engineering, Norwegian University of Science and Technology (NTNU), 6009, Ålesund, Norway.

Journal volume & issue: Vol. 10, no. 19
p. e38498

Abstract

Read online

This study proposes a vision transformer to detect visual defects on steel surfaces. The proposed approach utilizes an open-source image dataset to classify steel surface conditions into six fault categories namely, crazing, inclusion, rolled in, pitted surface, scratches and patches. The defect images are first subject to resizing and then fed into a vision transformer subject to different hyperparameter configurations to determine the most optimal setting to render highest classification performance. The performance of the model is evaluated for different hyperparameter configurations, and the most optimal configuration is examined using the associated confusion matrices. It was observed that the proposed model presents a high overall accuracy of 96.39 % for detection and classification of steel surface faults. The study presents a descriptive insight into the vision transformer architecture and in addition, compares the performance of the current model with the results of other approaches suggested for application in literature. Vision transformers can serve as standalone approaches and suitable alternatives to the widely used convolution neural networks (CNNs) by actuating complex defect detection and classification tasks in real-time, enabling efficient and robust condition monitoring of a wide range of defects.

Published in Heliyon

ISSN: 2405-8440 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General); Social Sciences: Social sciences (General)
Website: https://www.cell.com/heliyon/home

About the journal

Abstract

Keywords