Applied Sciences (May 2022)

Empirical Analysis of Forest Penalizing Attribute and Its Enhanced Variations for Android Malware Detection

  • Abimbola G. Akintola,
  • Abdullateef O. Balogun,
  • Luiz Fernando Capretz,
  • Hammed A. Mojeed,
  • Shuib Basri,
  • Shakirat A. Salihu,
  • Fatima E. Usman-Hamza,
  • Peter O. Sadiku,
  • Ghaniyyat B. Balogun,
  • Zubair O. Alanamu

DOI
https://doi.org/10.3390/app12094664
Journal volume & issue
Vol. 12, no. 9
p. 4664

Abstract

Read online

As a result of the rapid advancement of mobile and internet technology, a plethora of new mobile security risks has recently emerged. Many techniques have been developed to address the risks associated with Android malware. The most extensively used method for identifying Android malware is signature-based detection. The drawback of this method, however, is that it is unable to detect unknown malware. As a consequence of this problem, machine learning (ML) methods for detecting and classifying malware applications were developed. The goal of conventional ML approaches is to improve classification accuracy. However, owing to imbalanced real-world datasets, the traditional classification algorithms perform poorly in detecting malicious apps. As a result, in this study, we developed a meta-learning approach based on the forest penalizing attribute (FPA) classification algorithm for detecting malware applications. In other words, with this research, we investigated how to improve Android malware detection by applying empirical analysis of FPA and its enhanced variants (Cas_FPA and RoF_FPA). The proposed FPA and its enhanced variants were tested using the Malgenome and Drebin Android malware datasets, which contain features gathered from both static and dynamic Android malware analysis. Furthermore, the findings obtained using the proposed technique were compared with baseline classifiers and existing malware detection methods to validate their effectiveness in detecting malware application families. Based on the findings, FPA outperforms the baseline classifiers and existing ML-based Android malware detection models in dealing with the unbalanced family categorization of Android malware apps, with an accuracy of 98.94% and an area under curve (AUC) value of 0.999. Hence, further development and deployment of FPA-based meta-learners for Android malware detection and other cybersecurity threats is recommended.

Keywords