Visible to the public Clustering Analysis for Big Data in Network Security Domain Using a Spark-Based Method

TitleClustering Analysis for Big Data in Network Security Domain Using a Spark-Based Method
Publication TypeConference Paper
Year of Publication2020
AuthorsXu, Hui, Zhang, Wei, Gao, Man, Chen, Hongwei
Conference Name2020 IEEE 5th International Symposium on Smart and Wireless Systems within the Conferences on Intelligent Data Acquisition and Advanced Computing Systems (IDAACS-SWS)
Date PublishedSept. 2020
PublisherIEEE
ISBN Number978-1-7281-9960-3
KeywordsArrays, Big Data, Clustering algorithms, clustering analysis, Communication networks, compositionality, Distributed databases, intelligent data, k-means, Network security, Network Security Architecture, pubcrawl, resilience, Resiliency, Scalability, security, Spark, Sparks
AbstractConsidering the problem of network security under the background of big data, the clustering analysis algorithms can be utilized to improve the correctness of network intrusion detection models for security management. As a kind of iterative clustering analysis algorithm, K-means algorithm is not only simple but also efficient, so it is widely used. However, the traditional K-means algorithm cannot well solve the network security problem when facing big data due to its high complexity and limited processing ability. In this case, this paper proposes to optimize the traditional K-means algorithm based on the Spark platform and deploy the optimized clustering analysis algorithm in the distributed architecture, so as to improve the efficiency of clustering algorithm for network intrusion detection in big data environment. The experimental result shows that, compared with the traditional K-means algorithm, the efficiency of the optimized K-means algorithm using a Spark-based method is significantly improved in the running time.
URLhttps://ieeexplore.ieee.org/document/9297106
DOI10.1109/IDAACS-SWS50031.2020.9297106
Citation Keyxu_clustering_2020