Title | Proximal Policy Based Deep Reinforcement Learning Approach for Swarm Robots |
Publication Type | Conference Paper |
Year of Publication | 2021 |
Authors | Tan, Ziya, Karaköse, Mehmet |
Conference Name | 2021 Zooming Innovation in Consumer Technologies Conference (ZINC) |
Date Published | may |
Keywords | artificial intelligence, composability, compositionality, Deep Learning, deep reinforcement learning, pubcrawl, reinforcement learning, Service robots, swarm behavior, swarm intelligence, swarm robotics, Task Analysis, Technological innovation, Training |
Abstract | Artificial intelligence technology is becoming more active in all areas of our lives day by day. This technology affects our daily life by more developing in areas such as industry 4.0, security and education. Deep reinforcement learning is one of the most developed algorithms in the field of artificial intelligence. In this study, it is aimed that three different robots in a limited area learn to move without hitting each other, fixed obstacles and the boundaries of the field. These robots have been trained using the deep reinforcement learning approach and Proximal policy optimization (PPO) policy. Instead of uses value-based methods with the discrete action space, PPO that can easily manipulate the continuous action field and successfully determine the action of the robots has been proposed. PPO policy achieves successful results in multi-agent problems, especially with the use of the Actor-Critic network. In addition, information is given about environment control and learning approaches for swarm behavior. We propose parameter sharing and behavior-based method for this study. Finally, trained model is recorded and tested in 9 different environments where the obstacles are located differently. With our method, robots can perform their tasks in closed environments in the real world without damaging anyone or anything. |
DOI | 10.1109/ZINC52049.2021.9499288 |
Citation Key | tan_proximal_2021 |