Indic-TEDST: Datasets and Baselines for Low-Resource Speech to Text Translation

Sethiya, Nivedita; Maurya, Chandresh Kumar

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/14221

Full metadata record

DC Field	Value	Language
dc.contributor.author	Sethiya, Nivedita	en_US
dc.contributor.author	Maurya, Chandresh Kumar	en_US
dc.date.accessioned	2024-08-14T10:23:44Z	-
dc.date.available	2024-08-14T10:23:44Z	-
dc.date.issued	2024	-
dc.identifier.citation	Sethiya, N., Nair, S., & Maurya, C. K. (2024). Indic-TEDST: Datasets and Baselines for Low-Resource Speech to Text Translation. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings. https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195993950&partnerID=40&md5=022be970a18adaff6bdbf697c0a3f886	en_US
dc.identifier.isbn	978-2493814104	-
dc.identifier.other	EID(2-s2.0-85195993950)	-
dc.identifier.uri	https://dspace.iiti.ac.in/handle/123456789/14221	-
dc.description.abstract	Speech-to-text (ST) task is the translation of speech in a language to text in a different language. It has use cases in subtitling, dubbing, etc. Traditionally, ST tasks have been solved by cascading automatic speech recognition (ASR) and machine translation (MT) models which leads to error propagation, high latency, and training time. To minimize such issues, end-to-end models have been proposed recently. However, we find that only a few works have reported results of ST models on a limited number of low-resource languages. To take a step further in this direction, we release datasets and baselines for low-resource ST tasks. Concretely, our dataset has 9 language pairs and benchmarking has been done against SOTA ST models. The low performance of SOTA ST models on Indic-TEDST data indicates the necessity of the development of ST models specifically designed for low-resource languages. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.	en_US
dc.language.iso	en	en_US
dc.publisher	European Language Resources Association (ELRA)	en_US
dc.source	2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings	en_US
dc.subject	Automatic Speech Recognition	en_US
dc.subject	Indic Languages	en_US
dc.subject	Low-Resource Languages	en_US
dc.subject	Machine Translation	en_US
dc.subject	Speech-to-text Translation	en_US
dc.subject	Video Subtitling	en_US
dc.title	Indic-TEDST: Datasets and Baselines for Low-Resource Speech to Text Translation	en_US
dc.type	Conference Paper	en_US
Appears in Collections:	Department of Computer Science and Engineering

Files in This Item:

There are no files associated with this item.

Show simple item record

Altmetric Badge: