DeepMal: maliciousness-Preserving adversarial instruction learning against static malware detection

Chun Yang; Jinghui Xu; Shuangshuang Liang; Yanna Wu; Yu Wen; Boyang Zhang; Dan Meng

doi:10.1186/s42400-021-00079-5

Cybersecurity (May 2021)

DeepMal: maliciousness-Preserving adversarial instruction learning against static malware detection

Chun Yang,
Jinghui Xu,
Shuangshuang Liang,
Yanna Wu,
Yu Wen,
Boyang Zhang,
Dan Meng

Affiliations

Chun Yang: Institute of Information Engineering (IIE), Chinese Academy of Sciences (CAS), North of Yiyuan
Jinghui Xu: Institute of Information Engineering (IIE), Chinese Academy of Sciences (CAS), North of Yiyuan
Shuangshuang Liang: Institute of Information Engineering (IIE), Chinese Academy of Sciences (CAS), North of Yiyuan
Yanna Wu: Institute of Information Engineering (IIE), Chinese Academy of Sciences (CAS), North of Yiyuan
Yu Wen: Institute of Information Engineering (IIE), Chinese Academy of Sciences (CAS), North of Yiyuan
Boyang Zhang: Institute of Information Engineering (IIE), Chinese Academy of Sciences (CAS), North of Yiyuan
Dan Meng: Institute of Information Engineering (IIE), Chinese Academy of Sciences (CAS), North of Yiyuan

DOI: https://doi.org/10.1186/s42400-021-00079-5
Journal volume & issue: Vol. 4, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Outside the explosive successful applications of deep learning (DL) in natural language processing, computer vision, and information retrieval, there have been numerous Deep Neural Networks (DNNs) based alternatives for common security-related scenarios with malware detection among more popular. Recently, adversarial learning has gained much focus. However, unlike computer vision applications, malware adversarial attack is expected to guarantee malwares’ original maliciousness semantics. This paper proposes a novel adversarial instruction learning technique, DeepMal, based on an adversarial instruction learning approach for static malware detection. So far as we know, DeepMal is the first practical and systematical adversarial learning method, which could directly produce adversarial samples and effectively bypass static malware detectors powered by DL and machine learning (ML) models while preserving attack functionality in the real world. Moreover, our method conducts small-scale attacks, which could evade typical malware variants analysis (e.g., duplication check). We evaluate DeepMal on two real-world datasets, six typical DL models, and three typical ML models. Experimental results demonstrate that, on both datasets, DeepMal can attack typical malware detectors with the mean F1-score and F1-score decreasing maximal 93.94% and 82.86% respectively. Besides, three typical types of malware samples (Trojan horses, Backdoors, Ransomware) prove to preserve original attack functionality, and the mean duplication check ratio of malware adversarial samples is below 2.0%. Besides, DeepMal can evade dynamic detectors and be easily enhanced by learning more dynamic features with specific constraints.

Published in Cybersecurity

ISSN: 2523-3246 (Online)
Publisher: SpringerOpen
Country of publisher: Singapore
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://cybersecurity.springeropen.com/

About the journal

Abstract

Keywords