Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

Semantic relations classification in Hindi compound nouns using embeddings

dc.contributor.authorDwivedi V.; Ghosh S.
dc.date.accessioned2025-05-23T11:17:53Z
dc.description.abstractThe present work outlines the result of Hindi compound noun interpretation using embedding features. The BERT embedding and word2vec embeddings are used to represent the features of compound nouns. The work focuses on classifying the semantic relations between the constituents of Hindi Compound nouns. The Hindi compound noun dataset used for this experiment consists of 1500 compound nouns annotated with four semantic relations. BERT-based classification is based on the model architecture of the BBC Hindi news classification task by BERT. SVM classifier with the word2vec feature is also used for another experiment. We have classified the dataset into four major semantic relations: Purpose, Modifier, Topic, and Others, based on the frequency of the relations in the dataset and then performed a multiclass classification using a BERT classifier. We achieved a significant result with 48% accuracy. We got 47% accuracy in terms of F1 score by using the word2vec model. This is one of the first experiments toward semantic relation classification of Hindi compound nouns using BERT. © 2023, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
dc.identifier.doihttps://doi.org/10.1007/s41870-023-01374-9
dc.identifier.urihttp://172.23.0.11:4000/handle/123456789/7914
dc.relation.ispartofseriesInternational Journal of Information Technology (Singapore)
dc.titleSemantic relations classification in Hindi compound nouns using embeddings

Files

Collections