A Systematic Mapping of the Proposition of Benchmarks in the Software Testing and Debugging Domain

Deuslirio da Silva-Junior; Valdemar V. Graciano-Neto; Diogo M. de-Freitas; Plinio de Sá Leitão-Junior; Mohamad Kassab

doi:10.3390/software2040021

Software (Oct 2023)

A Systematic Mapping of the Proposition of Benchmarks in the Software Testing and Debugging Domain

Deuslirio da Silva-Junior,
Valdemar V. Graciano-Neto,
Diogo M. de-Freitas,
Plinio de Sá Leitão-Junior,
Mohamad Kassab

Affiliations

Deuslirio da Silva-Junior: Instituto de Informática, Universidade Federal de Goiás, Goiânia 74690-900, Goiás, Brazil
Valdemar V. Graciano-Neto: Instituto de Informática, Universidade Federal de Goiás, Goiânia 74690-900, Goiás, Brazil
Diogo M. de-Freitas: Instituto de Informática, Universidade Federal de Goiás, Goiânia 74690-900, Goiás, Brazil
Plinio de Sá Leitão-Junior: Instituto de Informática, Universidade Federal de Goiás, Goiânia 74690-900, Goiás, Brazil
Mohamad Kassab: Engineering Division, The Pennsylvania State University, Malvern, PA 16801, USA

DOI: https://doi.org/10.3390/software2040021
Journal volume & issue: Vol. 2, no. 4
pp. 447 – 475

Abstract

Read online

Software testing and debugging are standard practices of software quality assurance since they enable the identification and correction of failures. Benchmarks have been used in that context as a group of programs to support the comparison of different techniques according to pre-established parameters. However, the reasons that inspire researchers to propose novel benchmarks are not fully understood. This article reports the investigation, identification, classification, and externalization of the state of the art about the proposition of benchmarks on software testing and debugging domains. The study was carried out using systematic mapping procedures according to the guidelines widely followed by software engineering literature. The search identified 1674 studies, from which, 25 were selected for analysis. A list of benchmarks is provided and descriptively mapped according to their characteristics, motivations, and scope of use for their creation. The lack of data to support the comparison between available and novel software testing and debugging techniques is the main motivation for the proposition of benchmarks. Advancements in the standardization and prescription of benchmark structure and composition are still required. Establishing such a standard could foster benchmark reuse, thereby saving time and effort in the engineering of benchmarks for software testing and debugging.

Published in Software

ISSN: 2674-113X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://www.mdpi.com/journal/software

About the journal

Abstract

Keywords