Fuzzy knowledge based performance analysis on big data

Bharill, Neha; Tiwari, Aruna

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/4862

Full metadata record

DC Field	Value	Language
dc.contributor.author	Bharill, Neha	en_US
dc.contributor.author	Tiwari, Aruna	en_US
dc.date.accessioned	2022-03-17T01:00:00Z	-
dc.date.accessioned	2022-03-17T15:35:47Z	-
dc.date.available	2022-03-17T01:00:00Z	-
dc.date.available	2022-03-17T15:35:47Z	-
dc.date.issued	2020	-
dc.identifier.citation	Bharill, N., Tiwari, A., Malviya, A., Patel, O. P., Gupta, A., Puthal, D., . . . Prasad, M. (2020). Fuzzy knowledge based performance analysis on big data. Neurocomputing, 389, 218-228. doi:10.1016/j.neucom.2018.10.088	en_US
dc.identifier.issn	0925-2312	-
dc.identifier.other	EID(2-s2.0-85064842181)	-
dc.identifier.uri	https://doi.org/10.1016/j.neucom.2018.10.088	-
dc.identifier.uri	https://dspace.iiti.ac.in/handle/123456789/4862	-
dc.description.abstract	Due to the various emerging technologies, an enormous amount of data, termed as Big Data, gets collected every day and can be of great use in various domains. Clustering algorithms that store the entire data into memory for analysis become unfeasible when the dataset is too large. Many clustering algorithms present in the literature deal with the analysis of huge amount of data. The paper discusses a new clustering approach called an Incremental Random Sampling with Iterative Optimization Fuzzy c-Means (IRSIO-FCM) algorithm. It is implemented on Apache Spark, a framework for Big Data processing. Sparks works really well for iterative algorithms by supporting in-memory computations, scalability, etc. IRSIO-FCM not only facilitates effective clustering of Big Data but also performs storage space optimization during clustering. To establish a fair comparison of IRSIO-FCM, we propose an incremental version of the Literal Fuzzy c-Means (LFCM) called ILFCM implemented in Apache Spark framework. The experimental results are analyzed in terms of time and space complexity, NMI, ARI, speedup, sizeup, and scaleup measures. The reported results show that IRSIO-FCM achieves a significant reduction in run-time in comparison with ILFCM. © 2019 Elsevier B.V.	en_US
dc.language.iso	en	en_US
dc.publisher	Elsevier B.V.	en_US
dc.source	Neurocomputing	en_US
dc.subject	Big data	en_US
dc.subject	Cluster analysis	en_US
dc.subject	Digital storage	en_US
dc.subject	Fuzzy systems	en_US
dc.subject	Internet of things	en_US
dc.subject	Iterative methods	en_US
dc.subject	Knowledge based systems	en_US
dc.subject	Large dataset	en_US
dc.subject	Apache spark framework	en_US
dc.subject	Emerging technologies	en_US
dc.subject	Incremental clustering algorithm	en_US
dc.subject	Iterative Optimization	en_US
dc.subject	Parallel processing	en_US
dc.subject	Performance analysis	en_US
dc.subject	Time and space complexity	en_US
dc.subject	Very large datum	en_US
dc.subject	Clustering algorithms	en_US
dc.subject	big data	en_US
dc.subject	controlled study	en_US
dc.subject	internet of things	en_US
dc.subject	memory	en_US
dc.subject	sampling	en_US
dc.subject	storage	en_US
dc.title	Fuzzy knowledge based performance analysis on big data	en_US
dc.type	Journal Article	en_US
Appears in Collections:	Department of Computer Science and Engineering

Files in This Item:

There are no files associated with this item.

Show simple item record

Altmetric Badge: