Similarity-Based Malware Classification Using Graph Neural Networks

Yu-Hung Chen; Jiann-Liang Chen; Ren-Feng Deng

doi:10.3390/app122110837

Applied Sciences (Oct 2022)

Similarity-Based Malware Classification Using Graph Neural Networks

Yu-Hung Chen,
Jiann-Liang Chen,
Ren-Feng Deng

Affiliations

Yu-Hung Chen: Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei 106335, Taiwan
Jiann-Liang Chen: Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei 106335, Taiwan
Ren-Feng Deng: Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei 106335, Taiwan

DOI: https://doi.org/10.3390/app122110837
Journal volume & issue: Vol. 12, no. 21
p. 10837

Abstract

Read online

This work proposes a novel malware identification model that is based on a graph neural network (GNN). The function call relationship and function assembly content obtained by analyzing the malware are used to generate a graph that represents the functional structure of a malware sample. In addition to establishing a multi-classification model for predicting malware family, this work implements a similarity model that is based on Siamese networks, measuring the distance between two samples in the feature space to determine whether they belong to the same malware family. The distance between the samples is gradually adjusted during the training of the model to improve the performance. A Malware Bazaar dataset analysis reveals that the proposed classification model has an accuracy and area under the curve (AUC) of 0.934 and 0.997, respectively. The proposed similarity model has an accuracy and AUC of 0.92 and 0.92, respectively. Further, the proposed similarity model identifies the unseen malware family with approximately 70% accuracy. Hence, the proposed similarity model exhibits better performance and scalability than the pure classification model and previous studies.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords