Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/11620
Title: RADIANT: Better rPPG estimation using signal embeddings and Transformer
Authors: Gupta, Anup Kumar
Kumar, Rupesh
Birla, Lokendra
Gupta, Puneet
Keywords: Convolutional neural networks;Deep learning;Network architecture;Application: biomedical/healthcare/medicine;Color variations;Convolutional neural network;Heart-rate;Human eye;Noise characteristic;Non-contact;Rate estimation;Signal embedding;Skin colour;Embeddings
Issue Date: 2023
Publisher: Institute of Electrical and Electronics Engineers Inc.
Citation: Gupta, A. K., Kumar, R., Birla, L., & Gupta, P. (2023). RADIANT: Better rPPG estimation using signal embeddings and transformer. Paper presented at the Proceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023, 4965-4975. doi:10.1109/WACV56688.2023.00495 Retrieved from www.scopus.com
Abstract: Remote photoplethysmography can provide non-contact heart rate (HR) estimation by analyzing the skin color variations obtained from face videos. These variations are subtle, imperceptible to human eyes, and easily affected by noise. Existing deep learning-based rPPG estimators are incompetent due to three reasons. Firstly, they suppress the noise by utilizing information from the whole face even though different facial regions contain different noise characteristics. Secondly, local noise characteristics inherently affect the convolutional neural network (CNN) architectures. Lastly, the CNN sequential architectures fail to preserve long temporal dependencies. To address these issues, we propose RADIANT, that is, rPPG estimation using Signal Embeddings and Transformer. Our architecture utilizes a multi-head attention mechanism that facilitates feature subspace learning to extract the multiple correlations among the color variations corresponding to the periodic pulse. Also, its global information processing ability helps to suppress local noise characteristics. Furthermore, we propose novel signal embedding to enhance the rPPG feature representation and suppress noise. We have also improved the generalization of our architecture by adding a new training set. To this end, the effectiveness of synthetic temporal signals and data augmentations were explored. Experiments on extensively utilized rPPG datasets demonstrate that our architecture outperforms previous well-known architectures. Code: https://github.com/Deep-Intelligence-Lab/RADIANT.git © 2023 IEEE.
URI: https://doi.org/10.1109/WACV56688.2023.00495
https://dspace.iiti.ac.in/handle/123456789/11620
Type of Material: Conference Paper
Appears in Collections:Department of Computer Science and Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: