Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/15137
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSethiya, Niveditaen_US
dc.contributor.authorMaurya, Chandresh Kumaren_US
dc.date.accessioned2024-12-24T05:20:06Z-
dc.date.available2024-12-24T05:20:06Z-
dc.date.issued2025-
dc.identifier.citationSethiya, N., & Maurya, C. K. (2025). End-to-End Speech-to-Text Translation: A Survey. Computer Speech and Language. Scopus. https://doi.org/10.1016/j.csl.2024.101751en_US
dc.identifier.issn0885-2308-
dc.identifier.otherEID(2-s2.0-85209380627)-
dc.identifier.urihttps://doi.org/10.1016/j.csl.2024.101751-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/15137-
dc.description.abstractSpeech-to-Text (ST) translation pertains to the task of converting speech signals in one language to text in another language. It finds its application in various domains, such as hands-free communication, dictation, video lecture transcription, and translation, to name a few. Automatic Speech Recognition (ASR), as well as Machine Translation(MT) models, play crucial roles in traditional ST translation, enabling the conversion of spoken language in its original form to written text and facilitating seamless cross-lingual communication. ASR recognizes spoken words, while MT translates the transcribed text into the target language. Such integrated models suffer from cascaded error propagation and high resource and training costs. As a result, researchers have been exploring end-to-end (E2E) models for ST translation. However, to our knowledge, there is no comprehensive review of existing works on E2E ST. The present survey, therefore, discusses the works in this direction. We have attempted to provide a comprehensive review of models employed, metrics, and datasets used for ST tasks, providing challenges and future research direction with new insights. We believe this review will be helpful to researchers working on various applications of ST models. © 2024 Elsevier Ltden_US
dc.language.isoenen_US
dc.publisherAcademic Pressen_US
dc.sourceComputer Speech and Languageen_US
dc.subjectAutomatic speech recognitionen_US
dc.subjectMachine translationen_US
dc.subjectModality bridgingen_US
dc.subjectSpeech-to-text translationen_US
dc.titleEnd-to-End Speech-to-Text Translation: A Surveyen_US
dc.typeReviewen_US
Appears in Collections:Department of Computer Science and Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: