Title | Clustering Analysis for Big Data in Network Security Domain Using a Spark-Based Method |
Publication Type | Conference Paper |
Year of Publication | 2020 |
Authors | Xu, Hui, Zhang, Wei, Gao, Man, Chen, Hongwei |
Conference Name | 2020 IEEE 5th International Symposium on Smart and Wireless Systems within the Conferences on Intelligent Data Acquisition and Advanced Computing Systems (IDAACS-SWS) |
Date Published | Sept. 2020 |
Publisher | IEEE |
ISBN Number | 978-1-7281-9960-3 |
Keywords | Arrays, Big Data, Clustering algorithms, clustering analysis, Communication networks, compositionality, Distributed databases, intelligent data, k-means, Network security, Network Security Architecture, pubcrawl, resilience, Resiliency, Scalability, security, Spark, Sparks |
Abstract | Considering the problem of network security under the background of big data, the clustering analysis algorithms can be utilized to improve the correctness of network intrusion detection models for security management. As a kind of iterative clustering analysis algorithm, K-means algorithm is not only simple but also efficient, so it is widely used. However, the traditional K-means algorithm cannot well solve the network security problem when facing big data due to its high complexity and limited processing ability. In this case, this paper proposes to optimize the traditional K-means algorithm based on the Spark platform and deploy the optimized clustering analysis algorithm in the distributed architecture, so as to improve the efficiency of clustering algorithm for network intrusion detection in big data environment. The experimental result shows that, compared with the traditional K-means algorithm, the efficiency of the optimized K-means algorithm using a Spark-based method is significantly improved in the running time. |
URL | https://ieeexplore.ieee.org/document/9297106 |
DOI | 10.1109/IDAACS-SWS50031.2020.9297106 |
Citation Key | xu_clustering_2020 |