A Game-Theoretical Approach to Cyber-Security of Critical Infrastructures Based on Multi-Agent Reinforcement Learning

Submitted by grigby1 on Thu, 09/05/2019 - 11:28am

Title	A Game-Theoretical Approach to Cyber-Security of Critical Infrastructures Based on Multi-Agent Reinforcement Learning
Publication Type	Conference Paper
Year of Publication	2018
Authors	Panfili, M., Giuseppi, A., Fiaschetti, A., Al-Jibreen, H. B., Pietrabissa, A., Priscoli, F. Delli
Conference Name	2018 26th Mediterranean Conference on Control and Automation (MED)
ISBN Number	978-1-5386-7890-9
Keywords	Aerospace electronics, attack strategy, attack-defense problem, composability, composable security, Control Strategy, control theory, critical infrastructure, critical infrastructure protection, critical infrastructures, cyber-physical system defense, cyber-security, damage possible, European Project ATENA, game theory, game-theoretical approach, Games, learning (artificial intelligence), multi-agent systems, multiagent general sum game, multiagent reinforcement learning, Nash equilibrium, optimal security configuration, optimal trade-off between prevention actions, protected CI, pubcrawl, reinforcement learning, resilience, Resiliency, security, security of data, simulation results, stochastic games, system vulnerabilities, Vulnerability Management, zero-sum variant
Abstract	This paper presents a control strategy for Cyber-Physical System defense developed in the framework of the European Project ATENA, that concerns Critical Infrastructure (CI) protection. The aim of the controller is to find the optimal security configuration, in terms of countermeasures to implement, in order to address the system vulnerabilities. The attack/defense problem is modeled as a multi-agent general sum game, where the aim of the defender is to prevent the most damage possible by finding an optimal trade-off between prevention actions and their costs. The problem is solved utilizing Reinforcement Learning and simulation results provide a proof of the proposed concept, showing how the defender of the protected CI is able to minimize the damage caused by his her opponents by finding the Nash equilibrium of the game in the zero-sum variant, and, in a more general scenario, by driving the attacker in the position where the damage she/he can cause to the infrastructure is lower than the cost it has to sustain to enforce her/his attack strategy.
URL	https://ieeexplore.ieee.org/document/8442695
DOI	10.1109/MED.2018.8442695
Citation Key	panfili_game-theoretical_2018

Groups:

Science of Security VO