 
 
    Please use this identifier to cite or link to this item:
    
    
    https://dspace.iiti.ac.in/handle/123456789/13063
| Title: | 3sXcsNet: A framework for face presentation attack detection using deep learning | 
| Authors: | Biswas, Aparna Santra Dey, Somnath Ahirwar, Akash Kumar | 
| Keywords: | CBAM;Deep learning;Face Biometric;Multiscale Retinex;Presentation Attack Detection;Xception | 
| Issue Date: | 2024 | 
| Publisher: | Elsevier Ltd | 
| Citation: | Biswas, A. S., Dey, S., & Ahirwar, A. K. (2024). 3sXcsNet: A framework for face presentation attack detection using deep learning. Expert Systems with Applications. Scopus. https://doi.org/10.1016/j.eswa.2023.122821 | 
| Abstract: | Face Recognition is a widely used authentication method deployed for accessing personal digital devices like mobiles and laptops, digital door locks, or various security applications. However, presentation attacks in form of print, video and mask can circumvent such authentication methods. Several solutions based on texture features are proposed in the state-of-the-art nevertheless, they are sensitive to varying illumination conditions and are less resilient against 2D and 3D attacks. In this paper, we propose a novel 3-stream compact Xception framework (3sXcsNet) consisting of four input streams RGB, MSRCR, MSRCP, and a depth map to form two specific 3-stream fusion (CRF and CPF) to mitigate the illumination scenarios. These streams provide discriminative features. RGB images have rich texture features MSRCR and MSRCP images are illumination invariant and preserve better chromaticity applied on a color or intensity channel, respectively. In addition, an auxiliary depth map that provides depth information for each pixel makes it easier to analyze features and recognize whether the input image is from a real person or a spoof medium. The 3-stream CRF and CPF represent the features differently and feed independently to a deep CNN compact Xception with a CBAM network. These feature maps are fused using an average fusion mechanism for classification of real and spoof faces. We utilize both 2D and 3D attack datasets namely, REPLAY-ATTACK, CASIA-FASD, OULU-NPU, and 3DMAD to evaluate the effectiveness of the proposed framework. We attain an EER of 1.27% and 0.19% for CRF fusion, as well as 1.48% and 0.20% for CPF fusion when assessing performance on the CASIA-FASD and REPLAY-ATTACK datasets, respectively. On the OULU-NPU dataset, we obtain the most favorable results for two protocols while employing CRF fusion, resulting in ACER metrics of 0.7% and 3.2±2.7%, respectively. On the other hand, CPF fusion outperformed with an ACER of 1.8±1.0% for one protocol. Our findings on intra-database outperform the current state-of-the-art. We also perform cross-database testing to assess the generalization capability of the framework on 2D and 3D attack datasets. Furthermore, the proposed method outperforms both fusion procedures in terms of error rate HTER for intra-database and cross-database testing on the 3DMAD dataset. © 2023 Elsevier Ltd | 
| URI: | https://doi.org/10.1016/j.eswa.2023.122821 https://dspace.iiti.ac.in/handle/123456789/13063 | 
| ISSN: | 0957-4174 | 
| Type of Material: | Journal Article | 
| Appears in Collections: | Department of Computer Science and Engineering | 
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge:
            	
                
    
            
