Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/6111
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJain, Poojaen_US
dc.contributor.authorPachori, Ram Bilasen_US
dc.date.accessioned2022-03-17T01:00:00Z-
dc.date.accessioned2022-03-17T15:46:23Z-
dc.date.available2022-03-17T01:00:00Z-
dc.date.available2022-03-17T15:46:23Z-
dc.date.issued2014-
dc.identifier.citationJain, P., & Pachori, R. B. (2014). Event-based method for instantaneous fundamental frequency estimation from voiced speech based on eigenvalue decomposition of the hankel matrix. IEEE Transactions on Audio, Speech and Language Processing, 22(10), 1467-1482. doi:10.1109/TASLP.2014.2335056en_US
dc.identifier.issn1558-7916-
dc.identifier.otherEID(2-s2.0-84911369306)-
dc.identifier.urihttps://doi.org/10.1109/TASLP.2014.2335056-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/6111-
dc.description.abstractWe propose a robust event-based method for estimation of the instantaneous fundamental frequency of a voiced speech signal. The amplitude and frequency modulated (AM-FM) signal model of voiced speech in the low frequency range (LFR) indicates the presence of energy only around its instantaneous fundamental frequency (F0) and its few harmonics. The time-varying F0 component of a voiced speech signal is extracted by a robust algorithm which iteratively performs eigenvalue decomposition (EVD) of the Hankel matrix, initially constructed from samples of the LFR filtered voiced speech signal. The negative cycles of the extracted time-varying F0 component provide a reliable coarse estimate of intervals where glottal closure instants (GCIs) may be present. The negative cycles of the LFR filtered voiced speech signal occurring within these intervals are isolated. There is a sudden decrease in the glottal impedance at GCIs resulting in high signal strength. Therefore, GCIs are detected as local minima in the derivative of the falling edges of the isolated negative cycles of the LFR filtered voiced speech signal, followed by a selection criterion to discard false GCI candidates. The instantaneous F0 is estimated as the inverse of the time interval between two consecutive GCIs. Experiments were performed on the Keele and CSTR speech databases in white and babble noise environments at various levels of degradation to assess the performance of the proposed method. The proposed method substantially reduces the gross F0 estimation errors in comparison to some state of the art methods. © 2014 IEEE.en_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.sourceIEEE Transactions on Audio, Speech and Language Processingen_US
dc.subjectEigenvalues and eigenfunctionsen_US
dc.subjectFrequency estimationen_US
dc.subjectIterative methodsen_US
dc.subjectMatrix algebraen_US
dc.subjectNatural frequenciesen_US
dc.subjectSignal processingen_US
dc.subjectSpeechen_US
dc.subjectSpeech analysisen_US
dc.subjectSpeech communicationen_US
dc.subjectEigenvalue decompositionen_US
dc.subjectFrequency modulateden_US
dc.subjectFundamental frequenciesen_US
dc.subjectFundamental frequency estimationen_US
dc.subjectHankel matrixen_US
dc.subjectLow frequency rangeen_US
dc.subjectSpeech signal processingen_US
dc.subjectState-of-the-art methodsen_US
dc.subjectSpeech recognitionen_US
dc.titleEvent-based method for instantaneous fundamental frequency estimation from voiced speech based on eigenvalue decomposition of the Hankel matrixen_US
dc.typeJournal Articleen_US
Appears in Collections:Department of Electrical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: