 
 
    Please use this identifier to cite or link to this item:
    
    
    https://dspace.iiti.ac.in/handle/123456789/4839
| Title: | Anomaly Detection in Resource Constrained Environments With Streaming Data | 
| Authors: | Jain, Prarthi Jain, Seemandhar Shrivastava, Abhishek | 
| Keywords: | Data handling;Digital storage;Forestry;Dynamic Streaming;Empirical evaluations;Hardware implementations;High-dimensional;Receiver operating characteristic curves;Sliding window mechanism;State of the art;Storage spaces;Anomaly detection | 
| Issue Date: | 2021 | 
| Publisher: | Institute of Electrical and Electronics Engineers Inc. | 
| Citation: | Jain, P., Jain, S., R. Zaiane, O., & Srivastava, A. (2021). Anomaly detection in resource constrained environments with streaming data. IEEE Transactions on Emerging Topics in Computational Intelligence, doi:10.1109/TETCI.2021.3070660 | 
| Abstract: | Isolation Forest (or iForest) is a well-known technique for anomaly detection. It is, however, a bulky approach that assumes the luxury of large storage space and is also ineffective with dynamic streaming data so common nowadays in varied application domains. In this work, we present the Preprocessed Isolation Forest (PiForest) approach for anomaly detection that works well in resource constrained environments and is also effective on streaming data. PiForest is largely based on the iForest algorithm and to effectively handle the streaming data includes a pre-processing stage. In the pre-processing stage, Principal Component Analysis (PCA) is first harnessed to significantly reduce the dimension and bulk of the data. Subsequently, the streaming characteristic of the data is handled through a sliding window mechanism that creates sequential blocks of data for systematic processing. PiForest is able to identify anomalies as effectively as iForest and other state-of-the-art anomaly detection techniques but has substantially low storage and prediction complexity. We conduct empirical evaluation of the proposed approach with standard data sets and show that it performs comparably with standard techniques in terms of Area Under the Receiver Operating Characteristic Curve (AUC-ROC) and is able to work with high-dimensional, streaming data. Subsequently, we do a real-world hardware implementation of PiForest and demonstrate that the approach is realistic and practicable in resource-constrained environments. IEEE | 
| URI: | https://doi.org/10.1109/TETCI.2021.3070660 https://dspace.iiti.ac.in/handle/123456789/4839 | 
| ISSN: | 2471-285X | 
| Type of Material: | Journal Article | 
| Appears in Collections: | Department of Computer Science and Engineering | 
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge:
            	
                
    
            
