An Empirical Study on the Effect of Training Data Perturbations on Neural Network Robustness

Jie Wang; Zili Wu; Minyan Lu; Jun Ai

doi:10.3390/s24154874

Sensors (Jul 2024)

An Empirical Study on the Effect of Training Data Perturbations on Neural Network Robustness

Jie Wang,
Zili Wu,
Minyan Lu,
Jun Ai

Affiliations

Jie Wang: The Key Laboratory on Reliability and Environment Engineering Technology, School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China
Zili Wu: CRRC Zhuzhou Institute Co., Ltd., Zhuzhou 412001, China
Minyan Lu: The Key Laboratory on Reliability and Environment Engineering Technology, School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China
Jun Ai: The Key Laboratory on Reliability and Environment Engineering Technology, School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China

DOI: https://doi.org/10.3390/s24154874
Journal volume & issue: Vol. 24, no. 15
p. 4874

Abstract

Read online

The vulnerability of modern neural networks to random noise and deliberate attacks has raised concerns about their robustness, particularly as they are increasingly utilized in safety- and security-critical applications. Although recent research efforts were made to enhance robustness through retraining with adversarial examples or employing data augmentation techniques, a comprehensive investigation into the effects of training data perturbations on model robustness remains lacking. This paper presents the first extensive empirical study investigating the influence of data perturbations during model retraining. The experimental analysis focuses on both random and adversarial robustness, following established practices in the field of robustness analysis. Various types of perturbations in different aspects of the dataset are explored, including input, label, and sampling distribution. Single-factor and multi-factor experiments are conducted to assess individual perturbations and their combinations. The findings provide insights into constructing high-quality training datasets for optimizing robustness and recommend the appropriate degree of training set perturbations that balance robustness and correctness, and contribute to understanding model robustness in deep learning and offer practical guidance for enhancing model performance through perturbed retraining, promoting the development of more reliable and trustworthy deep learning systems for safety-critical applications.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords