Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/13063
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBiswas, Aparna Santraen_US
dc.contributor.authorDey, Somnathen_US
dc.contributor.authorAhirwar, Akash Kumaren_US
dc.date.accessioned2024-01-17T10:37:00Z-
dc.date.available2024-01-17T10:37:00Z-
dc.date.issued2024-
dc.identifier.citationBiswas, A. S., Dey, S., & Ahirwar, A. K. (2024). 3sXcsNet: A framework for face presentation attack detection using deep learning. Expert Systems with Applications. Scopus. https://doi.org/10.1016/j.eswa.2023.122821en_US
dc.identifier.issn0957-4174-
dc.identifier.otherEID(2-s2.0-85180406174)-
dc.identifier.urihttps://doi.org/10.1016/j.eswa.2023.122821-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/13063-
dc.description.abstractFace Recognition is a widely used authentication method deployed for accessing personal digital devices like mobiles and laptops, digital door locks, or various security applications. However, presentation attacks in form of print, video and mask can circumvent such authentication methods. Several solutions based on texture features are proposed in the state-of-the-arten_US
dc.description.abstractnevertheless, they are sensitive to varying illumination conditions and are less resilient against 2D and 3D attacks. In this paper, we propose a novel 3-stream compact Xception framework (3sXcsNet) consisting of four input streams RGB, MSRCR, MSRCP, and a depth map to form two specific 3-stream fusion (CRF and CPF) to mitigate the illumination scenarios. These streams provide discriminative features. RGB images have rich texture featuresen_US
dc.description.abstractMSRCR and MSRCP images are illumination invariant and preserve better chromaticity applied on a color or intensity channel, respectively. In addition, an auxiliary depth map that provides depth information for each pixel makes it easier to analyze features and recognize whether the input image is from a real person or a spoof medium. The 3-stream CRF and CPF represent the features differently and feed independently to a deep CNN compact Xception with a CBAM network. These feature maps are fused using an average fusion mechanism for classification of real and spoof faces. We utilize both 2D and 3D attack datasets namely, REPLAY-ATTACK, CASIA-FASD, OULU-NPU, and 3DMAD to evaluate the effectiveness of the proposed framework. We attain an EER of 1.27% and 0.19% for CRF fusion, as well as 1.48% and 0.20% for CPF fusion when assessing performance on the CASIA-FASD and REPLAY-ATTACK datasets, respectively. On the OULU-NPU dataset, we obtain the most favorable results for two protocols while employing CRF fusion, resulting in ACER metrics of 0.7% and 3.2±2.7%, respectively. On the other hand, CPF fusion outperformed with an ACER of 1.8±1.0% for one protocol. Our findings on intra-database outperform the current state-of-the-art. We also perform cross-database testing to assess the generalization capability of the framework on 2D and 3D attack datasets. Furthermore, the proposed method outperforms both fusion procedures in terms of error rate HTER for intra-database and cross-database testing on the 3DMAD dataset. © 2023 Elsevier Ltden_US
dc.language.isoenen_US
dc.publisherElsevier Ltden_US
dc.sourceExpert Systems with Applicationsen_US
dc.subjectCBAMen_US
dc.subjectDeep learningen_US
dc.subjectFace Biometricen_US
dc.subjectMultiscale Retinexen_US
dc.subjectPresentation Attack Detectionen_US
dc.subjectXceptionen_US
dc.title3sXcsNet: A framework for face presentation attack detection using deep learningen_US
dc.typeJournal Articleen_US
Appears in Collections:Department of Computer Science and Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: