Analysing Agreement Among Different Evaluators in God Class and Feature Envy Detection

Khalid Alkharabsheh; Sadi Alawadi; Yania Crespo; M. Esperanza Manso; Jose A. Taboada Gonzalez

doi:10.1109/ACCESS.2021.3123123

IEEE Access (Jan 2021)

Analysing Agreement Among Different Evaluators in God Class and Feature Envy Detection

Khalid Alkharabsheh,
Sadi Alawadi,
Yania Crespo,
M. Esperanza Manso,
Jose A. Taboada Gonzalez

Affiliations

Khalid Alkharabsheh: ORCiD; Department of Software Engineering, Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University (BAU), As-Salt, Jordan
Sadi Alawadi: ORCiD; Department of Information Technology, Uppsala University, Uppsala, Sweden
Yania Crespo: ORCiD; Departamento de Informática, Escuela de Ingeniería Informática, Universidad de Valladolid, Campus Miguel Delibes, Valladolid, Spain
M. Esperanza Manso: Departamento de Informática, Escuela de Ingeniería Informática, Universidad de Valladolid, Campus Miguel Delibes, Valladolid, Spain
Jose A. Taboada Gonzalez: ORCiD; CiTIUS, Centro Singular de Investigación en Tecnoloxías da Información, Universidad de Santiago de Compostela, Santiago de Compostela, Spain

DOI: https://doi.org/10.1109/ACCESS.2021.3123123
Journal volume & issue: Vol. 9
pp. 145191 – 145211

Abstract

Read online

The automatic detection of Design Smells has evolved in parallel to the evolution of automatic refactoring tools. There was a huge rise in research activity regarding Design Smell detection from 2010 to the present. However, it should be noted that the adoption of Design Smell detection in real software development practice is not comparable to the adoption of automatic refactoring tools. On the basis of the assumption that it is the objectiveness of a refactoring operation as opposed to the subjectivity in definition and identification of Design Smells that makes the difference, in this paper, the lack of agreement between different evaluators when detecting Design Smells is empirically studied. To do so, a series of experiments and studies were designed and conducted to analyse the concordance in Design Smell detection of different persons and tools, including a comparison between them. This work focuses on two well known Design Smells: God Class and Feature Envy. Concordance analysis is based on the Kappa statistic for inter-rater agreement (particularly Kappa-Fleiss). The results obtained show that there is no agreement in detection in general, and, in those cases where a certain agreement appears, it is considered to be a fair or poor degree of agreement, according to a Kappa-Fleiss interpretation scale. This seems to confirm that there is a subjective component which makes the raters evaluate the presence of Design Smells differently. The study also raises the question of a lack of training and experience regarding Design Smells.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords