Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/4862
Title: | Fuzzy knowledge based performance analysis on big data |
Authors: | Bharill, Neha Tiwari, Aruna |
Keywords: | Big data;Cluster analysis;Digital storage;Fuzzy systems;Internet of things;Iterative methods;Knowledge based systems;Large dataset;Apache spark framework;Emerging technologies;Incremental clustering algorithm;Iterative Optimization;Parallel processing;Performance analysis;Time and space complexity;Very large datum;Clustering algorithms;article;big data;controlled study;internet of things;memory;sampling;storage |
Issue Date: | 2020 |
Publisher: | Elsevier B.V. |
Citation: | Bharill, N., Tiwari, A., Malviya, A., Patel, O. P., Gupta, A., Puthal, D., . . . Prasad, M. (2020). Fuzzy knowledge based performance analysis on big data. Neurocomputing, 389, 218-228. doi:10.1016/j.neucom.2018.10.088 |
Abstract: | Due to the various emerging technologies, an enormous amount of data, termed as Big Data, gets collected every day and can be of great use in various domains. Clustering algorithms that store the entire data into memory for analysis become unfeasible when the dataset is too large. Many clustering algorithms present in the literature deal with the analysis of huge amount of data. The paper discusses a new clustering approach called an Incremental Random Sampling with Iterative Optimization Fuzzy c-Means (IRSIO-FCM) algorithm. It is implemented on Apache Spark, a framework for Big Data processing. Sparks works really well for iterative algorithms by supporting in-memory computations, scalability, etc. IRSIO-FCM not only facilitates effective clustering of Big Data but also performs storage space optimization during clustering. To establish a fair comparison of IRSIO-FCM, we propose an incremental version of the Literal Fuzzy c-Means (LFCM) called ILFCM implemented in Apache Spark framework. The experimental results are analyzed in terms of time and space complexity, NMI, ARI, speedup, sizeup, and scaleup measures. The reported results show that IRSIO-FCM achieves a significant reduction in run-time in comparison with ILFCM. © 2019 Elsevier B.V. |
URI: | https://doi.org/10.1016/j.neucom.2018.10.088 https://dspace.iiti.ac.in/handle/123456789/4862 |
ISSN: | 0925-2312 |
Type of Material: | Journal Article |
Appears in Collections: | Department of Computer Science and Engineering |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: