Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

Significance of Corpus Quality for Direct Speech-to-Text Translation Systems

dc.contributor.authorRajkhowa T.; Chowdhury A.R.; Kumar L.
dc.date.accessioned2025-05-23T11:13:44Z
dc.description.abstractPerformance improvement in Direct Speech-to- Text Translation systems are mainly attributed to its training using a larger corpora which led to the development of many larger corpora containing data scrapped from various online sources, This resulted in quality issues and impracticality in the long run. Hence, this work investigates the role of quality in the corpora to determine whether size or quality has more contribution in the performance of these systems. Experimental results indicate that a corpus containing a richer vocabulary with better translation and audio quality is more effective and has a greater contribution in the performance. © 2024 IEEE.
dc.identifier.doihttps://doi.org/10.1109/TENCON61640.2024.10902805
dc.identifier.urihttp://172.23.0.11:4000/handle/123456789/6142
dc.relation.ispartofseriesIEEE Region 10 Annual International Conference, Proceedings/TENCON
dc.titleSignificance of Corpus Quality for Direct Speech-to-Text Translation Systems

Files

Collections