Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/18284
Title: Automated Parkinson's Disease Detection System Using FBSE-FAWT-Based Time-Frequency Representation of Speech Signals
Authors: Vijay, Amishi
Pachori, Ram Bilas
Appina, Balasubramanyam
Issue Date: 2026
Publisher: Institute of Electrical and Electronics Engineers Inc.
Citation: Vijay, A., Pachori, R. B., Tiwari, N., & Appina, B. (2026). Automated Parkinson’s Disease Detection System Using FBSE-FAWT-Based Time-Frequency Representation of Speech Signals. IEEE Transactions on Human-Machine Systems. https://doi.org/10.1109/THMS.2026.3667216
Abstract: Parkinson's disease (PD) detection from speech signals attracts significant attention from researchers due to the effectiveness of speech impairment symptoms in PD diagnosis in its early phase. The short-time Fourier transform (STFT)-based analysis suffers from fixed resolution. The discrete wavelet transform (DWT) provides multiresolution analysis. However, an analytic wavelet transform with a flexible time-frequency covering (AWTFTFC) is a generalized form of DWT with flexible selection of redundancy, dilation factor, and Q-factor. The Fourier-Bessel series expansion (FBSE)-based flexible analytic wavelet transform (FAWT) (FBSE-FAWT) is an improved version of AWTFTFC with the replacement of the Fourier-based spectrum by the FBSE-based spectrum to have better frequency resolution. In this article, FBSE-FAWT-based time-frequency representation (TFR) is used to detect PD from speech signals. In the FBSE-FAWT-based TFR method, the FBSE-FAWT technique has been used to obtain subband signals and Hilbert spectral analysis is applied to compute TFR of speech signals. Seven different pretrained networks, namely, AlexNet, GoogleNet, SqueezeNet, DenseNet-101, InceptionNet-v3, VGG-16, and ResNet-50 are trained using the obtained TFRs to detect PD. The proposed framework with ResNet-50 demonstrates the highest performance with an accuracy of 98.05%, an F1-score of 98.27%, a specificity of 98.61%, a precision of 98.93%, and a recall of 97.62%. The robustness of the proposed framework is verified in cross-task situations where the framework has been trained on spontaneous dialogue task and tested on read text task. Further, the proposed framework achieves better performance as compared with its variant, where the FBSE-FAWT-based TFR is replaced with a spectrogram, as well as compared with other existing methods in the literature. © 2026 IEEE.
URI: https://dx.doi.org/10.1109/THMS.2026.3667216
https://dspace.iiti.ac.in:8080/jspui/handle/123456789/18284
ISSN: 2168-2291
Type of Material: Journal Article
Appears in Collections:Department of Electrical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: