Picking on the family: Disrupting android malware triage by forcing misclassification

Calleja Cortiñas, Alejandro; Martín, Alejandro; Menéndez, Héctor D.; Estévez Tapiador, Juan Manuel; Clark, David

Publication:
Picking on the family: Disrupting android malware triage by forcing misclassification

Identifiers

URI: https://hdl.handle.net/10016/33920

ISSN: 0957-4174

DOI: https://doi.org/10.1016/j.eswa.2017.11.032

UXXI: AR/0000020961

Files

picking_EXWA_2018.pdf (1.86 MB)

Publication date

2018-04-01

Authors

Calleja Cortiñas, Alejandro

Martín, Alejandro

Menéndez, Héctor D.

Estévez Tapiador, Juan Manuel

Clark, David

Publisher

Elsevier

Impact

Export

Abstract

Machine learning classification algorithms are widely applied to different malware analysis problems because of their proven abilities to learn from examples and perform relatively well with little human input. Use cases include the labelling of malicious samples according to families during triage of suspected malware. However, automated algorithms are vulnerable to attacks. An attacker could carefully manipulate the sample to force the algorithm to produce a particular output. In this paper we discuss one such attack on Android malware classifiers. We design and implement a prototype tool, called lagoDroid, that takes as input a malware sample and a target family, and modifies the sample to cause it to be classified as belonging to this family while preserving its original semantics. Our technique relies on a search process that generates variants of the original sample without modifying their semantics. We tested lagoDroid against RevealDroid, a recent, open source, Android malware classifier based on a variety of static features. IagoDroid successfully forces misclassification for 28 of the 29 representative malware families present in the DREBIN dataset. Remarkably, it does so by modifying just a single feature of the original malware. On average, it finds the first evasive sample in the first search iteration, and converges to a 100% evasive population within 4 iterations. Finally, we introduce RevealDroid*, a more robust classifier that implements several techniques proposed in other adversarial learning domains. Our experiments suggest that RevealDroid* can correctly detect up to 99% of the variants generated by lagoDroid. (C) 2017 The Authors. Published by Elsevier Ltd.

Keywords

malware classification, adversarial learning, genetic algorithms, Iagodroid

Bibliographic citation

Calleja, A., Martín, A., Menéndez, H.D., Tapiador, J., Clark, D. (2018). Picking on the family: Disrupting android malware triage by forcing misclassification. Expert Systems With Applications, 95, pp. 113-126. https://doi.org/10.1016/j.eswa.2017.11.032

Collections

DI - COSEC - Artículos de Revistas
OpenAIRE: Open Access Infrastructure for Research in Europe

Full item page

Publication:
Picking on the family: Disrupting android malware triage by forcing misclassification

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: Picking on the family: Disrupting android malware triage by forcing misclassification

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
Picking on the family: Disrupting android malware triage by forcing misclassification