3sXcsNet: A framework for face presentation attack detection using deep learning

Biswas, Aparna Santra; Dey, Somnath; Ahirwar, Akash Kumar

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/13063

Full metadata record

DC Field	Value	Language
dc.contributor.author	Biswas, Aparna Santra	en_US
dc.contributor.author	Dey, Somnath	en_US
dc.contributor.author	Ahirwar, Akash Kumar	en_US
dc.date.accessioned	2024-01-17T10:37:00Z	-
dc.date.available	2024-01-17T10:37:00Z	-
dc.date.issued	2024	-
dc.identifier.citation	Biswas, A. S., Dey, S., & Ahirwar, A. K. (2024). 3sXcsNet: A framework for face presentation attack detection using deep learning. Expert Systems with Applications. Scopus. https://doi.org/10.1016/j.eswa.2023.122821	en_US
dc.identifier.issn	0957-4174	-
dc.identifier.other	EID(2-s2.0-85180406174)	-
dc.identifier.uri	https://doi.org/10.1016/j.eswa.2023.122821	-
dc.identifier.uri	https://dspace.iiti.ac.in/handle/123456789/13063	-
dc.description.abstract	Face Recognition is a widely used authentication method deployed for accessing personal digital devices like mobiles and laptops, digital door locks, or various security applications. However, presentation attacks in form of print, video and mask can circumvent such authentication methods. Several solutions based on texture features are proposed in the state-of-the-art	en_US
dc.description.abstract	nevertheless, they are sensitive to varying illumination conditions and are less resilient against 2D and 3D attacks. In this paper, we propose a novel 3-stream compact Xception framework (3sXcsNet) consisting of four input streams RGB, MSRCR, MSRCP, and a depth map to form two specific 3-stream fusion (CRF and CPF) to mitigate the illumination scenarios. These streams provide discriminative features. RGB images have rich texture features	en_US
dc.description.abstract	MSRCR and MSRCP images are illumination invariant and preserve better chromaticity applied on a color or intensity channel, respectively. In addition, an auxiliary depth map that provides depth information for each pixel makes it easier to analyze features and recognize whether the input image is from a real person or a spoof medium. The 3-stream CRF and CPF represent the features differently and feed independently to a deep CNN compact Xception with a CBAM network. These feature maps are fused using an average fusion mechanism for classification of real and spoof faces. We utilize both 2D and 3D attack datasets namely, REPLAY-ATTACK, CASIA-FASD, OULU-NPU, and 3DMAD to evaluate the effectiveness of the proposed framework. We attain an EER of 1.27% and 0.19% for CRF fusion, as well as 1.48% and 0.20% for CPF fusion when assessing performance on the CASIA-FASD and REPLAY-ATTACK datasets, respectively. On the OULU-NPU dataset, we obtain the most favorable results for two protocols while employing CRF fusion, resulting in ACER metrics of 0.7% and 3.2±2.7%, respectively. On the other hand, CPF fusion outperformed with an ACER of 1.8±1.0% for one protocol. Our findings on intra-database outperform the current state-of-the-art. We also perform cross-database testing to assess the generalization capability of the framework on 2D and 3D attack datasets. Furthermore, the proposed method outperforms both fusion procedures in terms of error rate HTER for intra-database and cross-database testing on the 3DMAD dataset. © 2023 Elsevier Ltd	en_US
dc.language.iso	en	en_US
dc.publisher	Elsevier Ltd	en_US
dc.source	Expert Systems with Applications	en_US
dc.subject	CBAM	en_US
dc.subject	Deep learning	en_US
dc.subject	Face Biometric	en_US
dc.subject	Multiscale Retinex	en_US
dc.subject	Presentation Attack Detection	en_US
dc.subject	Xception	en_US
dc.title	3sXcsNet: A framework for face presentation attack detection using deep learning	en_US
dc.type	Journal Article	en_US
Appears in Collections:	Department of Computer Science and Engineering

Files in This Item:

There are no files associated with this item.

Show simple item record

Altmetric Badge: