Proximal Policy Based Deep Reinforcement Learning Approach for Swarm Robots

Submitted by grigby1 on Fri, 03/25/2022 - 11:44am

Title	Proximal Policy Based Deep Reinforcement Learning Approach for Swarm Robots
Publication Type	Conference Paper
Year of Publication	2021
Authors	Tan, Ziya, Karaköse, Mehmet
Conference Name	2021 Zooming Innovation in Consumer Technologies Conference (ZINC)
Date Published	may
Keywords	artificial intelligence, composability, compositionality, Deep Learning, deep reinforcement learning, pubcrawl, reinforcement learning, Service robots, swarm behavior, swarm intelligence, swarm robotics, Task Analysis, Technological innovation, Training
Abstract	Artificial intelligence technology is becoming more active in all areas of our lives day by day. This technology affects our daily life by more developing in areas such as industry 4.0, security and education. Deep reinforcement learning is one of the most developed algorithms in the field of artificial intelligence. In this study, it is aimed that three different robots in a limited area learn to move without hitting each other, fixed obstacles and the boundaries of the field. These robots have been trained using the deep reinforcement learning approach and Proximal policy optimization (PPO) policy. Instead of uses value-based methods with the discrete action space, PPO that can easily manipulate the continuous action field and successfully determine the action of the robots has been proposed. PPO policy achieves successful results in multi-agent problems, especially with the use of the Actor-Critic network. In addition, information is given about environment control and learning approaches for swarm behavior. We propose parameter sharing and behavior-based method for this study. Finally, trained model is recorded and tested in 9 different environments where the obstacles are located differently. With our method, robots can perform their tasks in closed environments in the real world without damaging anyone or anything.
DOI	10.1109/ZINC52049.2021.9499288
Citation Key	tan_proximal_2021

Groups:

Science of Security VO