Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/6154
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJain, Poojaen_US
dc.contributor.authorPachori, Ram Bilasen_US
dc.date.accessioned2022-03-17T01:00:00Z-
dc.date.accessioned2022-03-17T15:46:46Z-
dc.date.available2022-03-17T01:00:00Z-
dc.date.available2022-03-17T15:46:46Z-
dc.date.issued2012-
dc.identifier.citationJain, P., & Pachori, R. B. (2012). Time-order representation based method for epoch detection from speech signals. Journal of Intelligent Systems, 21(1), 79-95. doi:10.1515/jisys-2012-0003en_US
dc.identifier.issn0334-1860-
dc.identifier.otherEID(2-s2.0-84860119809)-
dc.identifier.urihttps://doi.org/10.1515/jisys-2012-0003-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/6154-
dc.description.abstractEpochs present in the voiced speech are defined as time instants of significant excitation of the vocal tract system during the production of speech. Nonstationary nature of excitation source and vocal tract system makes accurate identification of epochs a difficult task. Most of the existing methods for epoch detection require prior knowledge of voiced regions and a rough estimation of pitch frequency. In this paper, we propose a novel method that relies on time-order representation (TOR) based on short-time Fourier- Bessel (FB) series expansion which can be employed on entire speech signal to detect epochs without any prior information. The proposed method automatically detects voiced regions in the speech signal by computing the marginal energy density with respect to time in the low frequency range (LFR) from the energy distribution in the time-frequency plane. An estimate of pitch frequency for each detected voiced region is then obtained by computing the marginal energy density with respect to frequency in the LFR from the energy distribution in the time-frequency plane. Epochs are located for each detected voiced region as peaks in the derivative of the low pass filtered (LPF) signal corresponding to falling edges of peak negative cycles in the LPF signal synthesized from TOR coefficients corresponding to LFR. Experimental results obtained by the proposed method on speech signals taken from the CMU-Arctic database are found to be promising. The proposed method detects epochs with high accuracy and reliability. © de Gruyter 2012.en_US
dc.language.isoenen_US
dc.sourceJournal of Intelligent Systemsen_US
dc.subjectEnergy densityen_US
dc.subjectEnergy distributionsen_US
dc.subjectExcitation sourcesen_US
dc.subjectFalling edgeen_US
dc.subjectFourieren_US
dc.subjectFourier-Bessel series expansionen_US
dc.subjectLow frequency rangeen_US
dc.subjectLow-passen_US
dc.subjectNonstationaryen_US
dc.subjectPitch frequenciesen_US
dc.subjectPrior informationen_US
dc.subjectPrior knowledgeen_US
dc.subjectRough estimationen_US
dc.subjectSeries expansionen_US
dc.subjectSpeech signalsen_US
dc.subjectTime-frequency planesen_US
dc.subjectTime-order representationen_US
dc.subjectVocal-tractsen_US
dc.subjectVoiced speechen_US
dc.subjectElectric power distributionen_US
dc.subjectFourier seriesen_US
dc.subjectLow pass filtersen_US
dc.subjectSignal detectionen_US
dc.subjectSpeech recognitionen_US
dc.titleTime-order representation based method for epoch detection from speech signalsen_US
dc.typeJournal Articleen_US
Appears in Collections:Department of Electrical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: