Studia Universitatis Babes-Bolyai: Series Informatica (Jul 2021)
Detecting the Most Important Classes from Software Systems with Self Organizing Maps
Abstract
Self Organizing Maps (SOM) are unsupervised neural networks suited for visualisation purposes and clustering analysis. This study uses SOM to solve a software engineering problem: detecting the most important (key) classes from software projects. Key classes are meant to link the most valuable concepts of a software system and in general these are found in the solution documentation. UML models created in the design phase become deprecated in time and tend to be a source of confusion for large legacy software. Therefore, developers try to reconstruct class diagrams from the source code using reverse engineering. However, the resulting diagram is often very cluttered and difficult to understand. There is an interest for automatic tools for building concise class diagrams, but the machine learning possibilities are not fully explored at the moment. This paper proposes two possible algorithms to transform SOM in a classification algorithm to solve this task, which involves separating the important classes - that should be on the diagrams - from the others, less important ones. Moreover, SOM is a reliable visualization tool which able to provide an insight about the structure of the analysed projects. Received by the editors: 24 May 2021. 2010 Mathematics Subject Classification. 68T05, 68T20. 1998 CR Categories and Descriptors. I.2.6 [Artificial Intelligence]: Learning - Connectionism and neural nets; D.2.2 [Software Engineering]: Design Tools and Techniques - Computer-aided software engineering (CASE).
Keywords