Significance of Corpus Quality for Direct Speech-to-Text Translation Systems
| dc.contributor.author | Rajkhowa T.; Chowdhury A.R.; Kumar L. | |
| dc.date.accessioned | 2025-05-23T11:13:44Z | |
| dc.description.abstract | Performance improvement in Direct Speech-to- Text Translation systems are mainly attributed to its training using a larger corpora which led to the development of many larger corpora containing data scrapped from various online sources, This resulted in quality issues and impracticality in the long run. Hence, this work investigates the role of quality in the corpora to determine whether size or quality has more contribution in the performance of these systems. Experimental results indicate that a corpus containing a richer vocabulary with better translation and audio quality is more effective and has a greater contribution in the performance. © 2024 IEEE. | |
| dc.identifier.doi | https://doi.org/10.1109/TENCON61640.2024.10902805 | |
| dc.identifier.uri | http://172.23.0.11:4000/handle/123456789/6142 | |
| dc.relation.ispartofseries | IEEE Region 10 Annual International Conference, Proceedings/TENCON | |
| dc.title | Significance of Corpus Quality for Direct Speech-to-Text Translation Systems |