Abstract
© 2019 John Wiley & Sons, Ltd.Modern malware evolves various detection avoidance techniques to bypass the state-of-the-art detection methods. An emerging trend to deal with this issue is the combination of image transformation and machine learning models to classify and detect malware. However, existing works in this field only perform simple image transformation methods. These simple transformations have not considered color encoding and pixel rendering techniques on the performance of machine learning classifiers. In this article, we propose a novel approach to encoding and arranging bytes from binary files into images. These developed images contain statistical (eg, entropy) and syntactic artifacts (eg, strings), and their pixels are filled up using space-filling curves. Thanks to these features, our encoding method surpasses existing methods demonstrated by extensive experiments. In particular, our proposed method achieved 93.01% accuracy using the combination of the entropy encoding and character class scheme on the Hilbert curve.
| Original language | English |
|---|---|
| Article number | e3789 |
| Journal | Transactions on Emerging Telecommunications Technologies |
| Volume | 31 |
| Issue number | 11 |
| DOIs | |
| Publication status | Published - 1 Nov 2020 |
| Externally published | Yes |
Funding
The authors wish to thank the anonymous reviewers for their helpful comments. Duc‐Ly Vu and Fabio Massacci have partial received funding from the European Union's Horizon 2020 research and innovation program under grant 675320 (NeCS: European Network for Cyber Security). The authors wish to thank the anonymous reviewers for their helpful comments. Duc-Ly Vu and Fabio Massacci have partial received funding from the European Union's Horizon 2020 research and innovation program under grant 675320 (NeCS: European Network for Cyber Security).
| Funders | Funder number |
|---|---|
| European Network for Cyber Security | |
| European Union's Horizon 2020 | |
| European Union's Horizon 2020 research and innovation program | |
| Fabio Massacci | |
| Horizon 2020 Framework Programme | 675320 |