DeepBLOC: A Framework for Securing CPS through Deep Reinforcement Learning on Stochastic Games
Title | DeepBLOC: A Framework for Securing CPS through Deep Reinforcement Learning on Stochastic Games |
Publication Type | Conference Paper |
Year of Publication | 2020 |
Authors | Tahsini, A., Dunstatter, N., Guirguis, M., Ahmed, C. M. |
Conference Name | 2020 IEEE Conference on Communications and Network Security (CNS) |
Keywords | Biological system modeling, composability, CPS Modeling and Simulation, delays, Games, machine learning, predictability, Predictive Metrics, pubcrawl, Resiliency, Scalability, security, Security Heuristics, Sensors, Stochastic processes |
Abstract | One important aspect in protecting Cyber Physical System (CPS) is ensuring that the proper control and measurement signals are propagated within the control loop. The CPS research community has been developing a large set of check blocks that can be integrated within the control loop to check signals against various types of attacks (e.g., false data injection attacks). Unfortunately, it is not possible to integrate all these "checks" within the control loop as the overhead introduced when checking signals may violate the delay constraints of the control loop. Moreover, these blocks do not completely operate in isolation of each other as dependencies exist among them in terms of their effectiveness against detecting a subset of attacks. Thus, it becomes a challenging and complex problem to assign the proper checks, especially with the presence of a rational adversary who can observe the check blocks assigned and optimizes her own attack strategies accordingly. This paper tackles the inherent state-action space explosion that arises in securing CPS through developing DeepBLOC (DB)-a framework in which Deep Reinforcement Learning algorithms are utilized to provide optimal/sub-optimal assignments of check blocks to signals. The framework models stochastic games between the adversary and the CPS defender and derives mixed strategies for assigning check blocks to ensure the integrity of the propagated signals while abiding to the real-time constraints dictated by the control loop. Through extensive simulation experiments and a real implementation on a water purification system, we show that DB achieves assignment strategies that outperform other strategies and heuristics. |
DOI | 10.1109/CNS48642.2020.9162219 |
Citation Key | tahsini_deepbloc_2020 |