Machine Learning-Based Fuzz Testing Techniques: A Survey

Ao Zhang; Yiying Zhang; Yao Xu; Cong Wang; Siwei Li

doi:10.1109/ACCESS.2023.3347652

IEEE Access (Jan 2024)

Machine Learning-Based Fuzz Testing Techniques: A Survey

Ao Zhang,
Yiying Zhang,
Yao Xu,
Cong Wang,
Siwei Li

Affiliations

Ao Zhang: ORCiD; College of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, China
Yiying Zhang: ORCiD; College of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, China
Yao Xu: ORCiD; College of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, China
Cong Wang: ORCiD; College of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin, China
Siwei Li: State Grid Information and Telecommunication Company Ltd., Beijing, China

DOI: https://doi.org/10.1109/ACCESS.2023.3347652
Journal volume & issue: Vol. 12
pp. 14437 – 14454

Abstract

Read online

Fuzz testing is a vulnerability discovery technique that tests the robustness of target programs by providing them with unconventional data. With the rapid increase in software quantity, scale and complexity, traditional fuzzing has revealed issues such as incomplete logic coverage, low automation level and insufficient test cases. Machine learning, with its exceptional capabilities in data analysis and classification prediction, presents a promising approach for improve fuzzing. This paper investigates the latest research results in fuzzing and provides a systematic review of machine learning-based fuzzing techniques. Firstly, by outlining the workflow of fuzzing, it summarizes the optimization of different stages of fuzzing using machine learning. Specifically, it focuses on the application of machine learning in the preprocessing phase, test case generation phase, input selection phase and result analysis phase. Secondly, it mentally focuses on the optimization methods of machine learning in the process of mutation, generation and filtering of test cases and compares and analyzes its technical principles. Furthermore, it analyzes the performance gains brought by applying machine learning techniques to fuzzing, mainly including coverage, vulnerability detection capability, efficiency and effectiveness of test cases. Lastly, it concludes by summarizing the challenges and difficulties in combining machine learning with fuzzing and presents prospects for future trends in this field.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords