Title | Mixed Initiative Balance of Human-Swarm Teaming in Surveillance via Reinforcement learning |
Publication Type | Conference Paper |
Year of Publication | 2021 |
Authors | Xu, Chengtao, Song, Houbing |
Conference Name | 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC) |
Date Published | oct |
Keywords | Complexity theory, Computational modeling, decision making, human factors, Human-Swarm Teaming, Mixed Initiative Balance, pubcrawl, reinforcement learning, resilience, Resiliency, Resource management, Robot Trust, Robust Persistent Surveillance, simulation, surveillance |
Abstract | Human-machine teaming (HMT) operates in a context defined by the mission. Varying from the complexity and disturbance in the cooperation between humans and machines, a single machine has difficulty handling work with humans in the scales of efficiency and workload. Swarm of machines provides a more feasible solution in such a mission. Human-swarm teaming (HST) extends the concept of HMT in the mission, such as persistent surveillance, search-and-rescue, warfare. Bringing the concept of HST faces several scientific challenges. For example, the strategies of allocation on the high-level decision making. Here, human usually plays the supervisory or decision making role. Performance of such fixed structure of HST in actual mission operation could be affected by the supervisor's status from many aspects, which could be considered in three general parts: workload, situational awareness, and trust towards the robot swarm teammate and mission performance. Besides, the complexity of a single human operator in accessing multiple machine agents increases the work burdens. An interface between swarm teammates and human operators to simplify the interaction process is desired in the HST.In this paper, instead of purely considering the workload of human teammates, we propose the computational model of human swarm interaction (HSI) in the simulated map surveillance mission. UAV swarm and human supervisor are both assigned in searching a predefined area of interest (AOI). The workload allocation of map monitoring is adjusted based on the status of the human worker and swarm teammate. Workload, situation awareness ability, trust are formulated as independent models, which affect each other. A communication-aware UAV swarm persistent surveillance algorithm is assigned in the swarm autonomy portion. With the different surveillance task loads, the swarm agent's thrust parameter adjusts the autonomy level to fit the human operator's needs. Reinforcement learning is applied in seeking the relative balance of workload in both human and swarm sides. Metrics such as mission accomplishment rate, human supervisor performance, mission performance of UAV swarm are evaluated in the end. The simulation results show that the algorithm could learn the human-machine trust interaction to seek the workload balance to reach better mission execution performance. This work inspires us to leverage a more comprehensive HST model in more practical HMT application scenarios. |
DOI | 10.1109/DASC52595.2021.9594355 |
Citation Key | xu_mixed_2021 |