Decoding the Secrets of Machine Learning in Windows Malware Classification: A Deep Dive into Datasets, Features, and Model Performance

This repository hosts the feature importance scores of the following experiments:

Binary detection

Static

All samples
Packed only
No packed

Dynamic

All samples

Family classification

Static

All samples
Packed only
No packed

Dynamic

All samples

Citation

If you use any of the contents, please cite it as:

@inproceedings{dambra2023decoding,
  title={Decoding the secrets of machine learning in malware classification: A deep dive into datasets, feature extraction, and model performance},
  author={Dambra, Savino and Han, Yufei and Aonzo, Simone and Kotzias, Platon and Vitale, Antonino and Caballero, Juan and Balzarotti, Davide and Bilge, Leyla},
  booktitle={Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security},
  pages={60--74},
  year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Decoding the Secrets of Machine Learning in Windows Malware Classification: A Deep Dive into Datasets, Features, and Model Performance

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Decoding the Secrets of Machine Learning in Windows Malware Classification: A Deep Dive into Datasets, Features, and Model Performance

Citation