Smart Security Audit: Reinforcement Learning with a Deep Neural Network Approximator

Submitted by aekwall on Tue, 04/27/2021 - 1:29pm

Title	Smart Security Audit: Reinforcement Learning with a Deep Neural Network Approximator
Publication Type	Conference Paper
Year of Publication	2020
Authors	Pozdniakov, K., Alonso, E., Stankovic, V., Tam, K., Jones, K.
Conference Name	2020 International Conference on Cyber Situational Awareness, Data Analytics and Assessment (CyberSA)
Keywords	Adaptation models, audit, Biological neural networks, Deep Neural Network, Human Behavior, Penetration Testing, pentesting, pubcrawl, q-learning, reinforcement learning, Resiliency, Scalability, Security Audits, Task Analysis, Tools
Abstract	A significant challenge in modern computer security is the growing skill gap as intruder capabilities increase, making it necessary to begin automating elements of penetration testing so analysts can contend with the growing number of cyber threats. In this paper, we attempt to assist human analysts by automating a single host penetration attack. To do so, a smart agent performs different attack sequences to find vulnerabilities in a target system. As it does so, it accumulates knowledge, learns new attack sequences and improves its own internal penetration testing logic. As a result, this agent (AgentPen for simplicity) is able to successfully penetrate hosts it has never interacted with before. A computer security administrator using this tool would receive a comprehensive, automated sequence of actions leading to a security breach, highlighting potential vulnerabilities, and reducing the amount of menial tasks a typical penetration tester would need to execute. To achieve autonomy, we apply an unsupervised machine learning algorithm, Q-learning, with an approximator that incorporates a deep neural network architecture. The security audit itself is modelled as a Markov Decision Process in order to test a number of decision-making strategies and compare their convergence to optimality. A series of experimental results is presented to show how this approach can be effectively used to automate penetration testing using a scalable, i.e. not exhaustive, and adaptive approach.
DOI	10.1109/CyberSA49311.2020.9139683
Citation Key	pozdniakov_smart_2020

Groups:

Science of Security VO