Title | Evaluating the Performance of Various Deep Reinforcement Learning Algorithms for a Conversational Chatbot |
Publication Type | Conference Paper |
Year of Publication | 2021 |
Authors | Rajamalli Keerthana, R, Fathima, G, Florence, Lilly |
Conference Name | 2021 2nd International Conference for Emerging Technology (INCET) |
Keywords | Attention Model, BiRNN, chatbot, conversational agents, Deep Learning, deep reinforcement learning, DQN, Human Behavior, Industries, Measurement, Metrics, NMT, pubcrawl, q-learning, QR-DQN, Recurrent neural networks, reinforcement learning, Scalability, software libraries |
Abstract | Conversational agents are the most popular AI technology in IT trends. Domain specific chatbots are now used by almost every industry in order to upgrade their customer service. The Proposed paper shows the modelling and performance of one such conversational agent created using deep learning. The proposed model utilizes NMT (Neural Machine Translation) from the TensorFlow software libraries. A BiRNN (Bidirectional Recurrent Neural Network) is used in order to process input sentences that contain large number of tokens (20-40 words). In order to understand the context of the input sentence attention model is used along with BiRNN. The conversational models usually have one drawback, that is, they sometimes provide irrelevant answer to the input. This happens quite often in conversational chatbots as the chatbot doesn't realize that it is answering without context. This drawback is solved in the proposed system using Deep Reinforcement Learning technique. Deep reinforcement Learning follows a reward system that enables the bot to differentiate between right and wrong answers. Deep Reinforcement Learning techniques allows the chatbot to understand the sentiment of the query and reply accordingly. The Deep Reinforcement Learning algorithms used in the proposed system is Q-Learning, Deep Q Neural Network (DQN) and Distributional Reinforcement Learning with Quantile Regression (QR-DQN). The performance of each algorithm is evaluated and compared in this paper in order to find the best DRL algorithm. The dataset used in the proposed system is Cornell Movie-dialogs corpus and CoQA (A Conversational Question Answering Challenge). CoQA is a large dataset that contains data collected from 8000+ conversations in the form of questions and answers. The main goal of the proposed work is to increase the relevancy of the chatbot responses and to increase the perplexity of the conversational chatbot. |
DOI | 10.1109/INCET51464.2021.9456321 |
Citation Key | rajamalli_keerthana_evaluating_2021 |