Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

TLSPG: Transfer learning-based semi-supervised pseudo-corpus generation approach for zero-shot translation

dc.contributor.authorKumar, Amit
dc.contributor.authorMundotiya, Rajesh Kumar
dc.contributor.authorPratap, Ajay
dc.contributor.authorSingh, Anil Kumar
dc.date.accessioned2023-04-19T05:02:04Z
dc.date.available2023-04-19T05:02:04Z
dc.date.issued2022-10
dc.descriptionThis paper is submitted by the author of IIT (BHU), Varanasi, Indiaen_US
dc.description.abstractMachine Translation (MT) has come a long way in recent years, but it still suffers from data scarcity issue due to lack of parallel corpora for low (or sometimes zero) resource languages. However, Transfer Learning (TL) is one of the directions widely used for low-resource machine translation systems to overcome this issue. Creating parallel corpus for such languages is another way of dealing with data scarcity, yet costly, time-consuming and laborious task. In order to avoid the above listed limitations of parallel corpus formation, we present a TL-based Semi-supervised Pseudo-corpus Generation (TLSPG) approach for zero-shot MT systems. It generates the pseudo corpus by exploiting the relatedness between low resource language pairs and zero-resource language pairs via TL approach. It is further empirically ascertained in our experiments that such relatedness helps improve the performance of zero-shot MT systems. Experiments on zero-resource language pairs show that our approach effectively outperforms the existing state-of-the-art models, yielding improvement of +15.56,+8.13,+3.98 and +2 BLEU points for Bhojpuri→Hindi, Magahi→Hindi, Hindi→Bhojpuri and Hindi→Magahi, respectively.en_US
dc.description.sponsorshipScience and Engineering Research Board , IIT (BHU), Varanasi, Indiaen_US
dc.identifier.issn13191578
dc.identifier.urihttps://idr-sdlib.iitbhu.ac.in/handle/123456789/2101
dc.language.isoen_USen_US
dc.publisherKing Saud bin Abdulaziz Universityen_US
dc.relation.ispartofseriesJournal of King Saud University - Computer and Information Sciences;Volume 34, Issue 9, Pages 6552 - 6563
dc.subjectTransfer Learningen_US
dc.subjectZero-shot Translationen_US
dc.subjectSemi-superviseden_US
dc.subjectMachine Translationen_US
dc.titleTLSPG: Transfer learning-based semi-supervised pseudo-corpus generation approach for zero-shot translationen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1-s2.0-S1319157822000921-main.pdf
Size:
1.18 MB
Format:
Adobe Portable Document Format
Description:
Article - Gold Open Access

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: