Can machine learning model with static features be fooled

an adversarial machine learning approach

Journal article (2020)

Authors

Rahim Taheri Shiraz University of Technology

Reza Javidan Shiraz University of Technology

Mohammad Shojafar Università degli Studi di Padova

P. Vinod Università degli Studi di Padova

M. Conti Università degli Studi di Padova

Affiliation

External organisation

Generative adversarial network Adversarial machine learning Android malware detection Jacobian algorithm Poison attacks

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:6359a773-d123-41a8-a4d6-d65133ca83d9

Published Date

01-12-2020

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Affiliation

External organisation

Abstract

The widespread adoption of smartphones dramatically increases the risk of attacks and the spread of mobile malware, especially on the Android platform. Machine learning-based solutions have been already used as a tool to supersede signature-based anti-malware systems. However, malware authors leverage features from malicious and legitimate samples to estimate statistical difference in-order to create adversarial examples. Hence, to evaluate the vulnerability of machine learning algorithms in malware detection, we propose five different attack scenarios to perturb malicious applications (apps). By doing this, the classification algorithm inappropriately fits the discriminant function on the set of data points, eventually yielding a higher misclassification rate. Further, to distinguish the adversarial examples from benign samples, we propose two defense mechanisms to counter attacks. To validate our attacks and solutions, we test our model on three different benchmark datasets. We also test our methods using various classifier algorithms and compare them with the state-of-the-art data poisoning method using the Jacobian matrix. Promising results show that generated adversarial samples can evade detection with a very high probability. Additionally, evasive variants generated by our attack models when used to harden the developed anti-malware system improves the detection rate up to 50% when using the generative adversarial network (GAN) method.