Visible to the public Big Data and its Analyzing Tools : A Perspective

TitleBig Data and its Analyzing Tools : A Perspective
Publication TypeConference Paper
Year of Publication2020
AuthorsJaiswal, Ayshwarya, Dwivedi, Vijay Kumar, Yadav, Om Prakash
Conference Name2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS)
Date Publishedmar
KeywordsBIGDATA, Bull-eye, cryptography, Fault tolerance, Fault tolerant systems, Hadoop, HDFS, Human Behavior, Java, Kerberos, Kerberos Mechanism, Metrics, pubcrawl, Resiliency, security, Spark, Sparks, Storms, Tools
AbstractData are generated and stored in databases at a very high speed and hence it need to be handled and analyzed properly. Nowadays industries are extensively using Hadoop and Spark to analyze the datasets. Both the frameworks are used for increasing processing speeds in computing huge complex datasets. Many researchers are comparing both of them. Now, the big questions arising are, Is Spark a substitute for Hadoop? Is hadoop going to be replaced by spark in mere future?. Spark is "built on top of" Hadoop and it extends the model to deploy more types of computations which incorporates Stream Processing and Interactive Queries. No doubt, Spark's execution speed is much faster than Hadoop, but talking in terms of fault tolerance, hadoop is slightly more fault tolerant than spark. In this article comparison of various bigdata analytics tools are done and Hadoop and Spark are discussed in detail. This article further gives an overview of bigdata, spark and hadoop issues. In this survey paper, the approaches to resolve the issues of spark and hadoop are discussed elaborately.
DOI10.1109/ICACCS48705.2020.9074222
Citation Keyjaiswal_big_2020