Performance Evaluation of Classification Algorithms in the Design of Apache Spark based Intrusion Detection System
Title | Performance Evaluation of Classification Algorithms in the Design of Apache Spark based Intrusion Detection System |
Publication Type | Conference Paper |
Year of Publication | 2020 |
Authors | reddy, S. V. Siva, Saravanan, S. |
Conference Name | 2020 5th International Conference on Communication and Electronics Systems (ICCES) |
Date Published | June 2020 |
Publisher | IEEE |
ISBN Number | 978-1-7281-5371-1 |
Keywords | Big Data, Chi-square, Intrusion Detection System (IDS), Knowledge Discovery in Database (KDD), Metrics, pubcrawl, resilience, Resiliency, Scalability, security, support vector machine (SVM) |
Abstract | Information security is a process of securing data from security breaches, hackers. The program of intrusion detection is a software framework that keeps tracking and analyzing the data in the network to identify the attacks by using traditional techniques. These traditional intrusion techniques work very efficient when it uses on small data. but when the same techniques used for big data, process of analyzing the data properties take long time and become not efficient and need to use the big data technologies like Apache Spark, Hadoop, Flink etc. to design modern Intrusion Detection System (IDS). In this paper, the design of Apache Spark and classification algorithm-based IDS is presented and employed Chi-square as a feature selection method for selecting the features from network security events data. The performance of Logistic Regression, Decision Tree and SVM is evaluated with SGD in the design of Apache Spark based IDS with AUROC and AUPR used as metrics. Also tabulated the training and testing time of each algorithm and employed NSL-KDD dataset for designing all our experiments. |
URL | https://ieeexplore.ieee.org/document/9138066 |
DOI | 10.1109/ICCES48766.2020.9138066 |
Citation Key | reddy_performance_2020 |