Prescience: Probabilistic Guidance on the Retraining Conundrum for Malware Detection

Submitted by grigby1 on Mon, 06/05/2017 - 12:33pm

Title	Prescience: Probabilistic Guidance on the Retraining Conundrum for Malware Detection
Publication Type	Conference Paper
Year of Publication	2016
Authors	Deo, Amit, Dash, Santanu Kumar, Suarez-Tangil, Guillermo, Vovk, Volodya, Cavallaro, Lorenzo
Conference Name	Proceedings of the 2016 ACM Workshop on Artificial Intelligence and Security
Publisher	ACM
Conference Location	New York, NY, USA
ISBN Number	978-1-4503-4573-6
Keywords	artificial intelligence security, composability, concept drift, Human Behavior, malware detection, Metrics, probabilistic prediction, pubcrawl, Resiliency
Abstract	Malware evolves perpetually and relies on increasingly so- phisticated attacks to supersede defense strategies. Data-driven approaches to malware detection run the risk of becoming rapidly antiquated. Keeping pace with malware requires models that are periodically enriched with fresh knowledge, commonly known as retraining. In this work, we propose the use of Venn-Abers predictors for assessing the quality of binary classification tasks as a first step towards identifying antiquated models. One of the key benefits behind the use of Venn-Abers predictors is that they are automatically well calibrated and offer probabilistic guidance on the identification of nonstationary populations of malware. Our framework is agnostic to the underlying classification algorithm and can then be used for building better retraining strategies in the presence of concept drift. Results obtained over a timeline-based evaluation with about 90K samples show that our framework can identify when models tend to become obsolete.
URL	http://doi.acm.org/10.1145/2996758.2996769
DOI	10.1145/2996758.2996769
Citation Key	deo_prescience:_2016

Groups:

Science of Security VO