A Novel Clustering Technique for Efficient Clustering of Big Data in Hadoop Ecosystem

Sunil Kumar; Maninder Singh

doi:10.26599/BDMA.2018.9020037

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (32 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Open Access

A Novel Clustering Technique for Efficient Clustering of Big Data in Hadoop Ecosystem

Sunil Kumar(

), Maninder Singh

∙ Directorate of Livestock Farms, Guru Angad Dev Veterinary and Animal Sciences University, Ludhiana 141001, India.

∙ Department of Computer Science, Punjabi University, Punjab 147002, India.

Show Author Information

Abstract

Big data analytics and data mining are techniques used to analyze data and to extract hidden information. Traditional approaches to analysis and extraction do not work well for big data because this data is complex and of very high volume. A major data mining technique known as data clustering groups the data into clusters and makes it easy to extract information from these clusters. However, existing clustering algorithms, such as $k$ -means and hierarchical, are not efficient as the quality of the clusters they produce is compromised. Therefore, there is a need to design an efficient and highly scalable clustering algorithm. In this paper, we put forward a new clustering algorithm called hybrid clustering in order to overcome the disadvantages of existing clustering algorithms. We compare the new hybrid algorithm with existing algorithms on the bases of precision, recall, F-measure, execution time, and accuracy of results. From the experimental results, it is clear that the proposed hybrid clustering algorithm is more accurate, and has better precision, recall, and F-measure values.

Keywords

clustering Hadoop big data k-means hierarchical

References

【1】

Crossref Google Scholar

Big Data Mining and Analytics

Volume 2 Issue 4,
December 2019

Pages 240-247

DOI: 10.26599/BDMA.2018.9020037

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Kumar S, Singh M. A Novel Clustering Technique for Efficient Clustering of Big Data in Hadoop Ecosystem. Big Data Mining and Analytics, 2019, 2(4): 240-247. https://doi.org/10.26599/BDMA.2018.9020037

1782

Views

162

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 08 November 2018

Revised: 09 January 2019

Accepted: 12 January 2019

Published: 05 August 2019

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).