Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

Intuitionistic fuzzy rough set model based on k-means and its application to enhance prediction of aptamer–protein interacting pairs

dc.contributor.authorJain P.; Tiwari A.; Som T.
dc.date.accessioned2025-05-23T11:13:17Z
dc.description.abstractAptamers are very interesting peptide molecules or oligonucleic acid. They are used to bind particular target molecules. Aptamers play vital roles in various practical applications and physiological functions. Consequently, several diseases can be treated using therapies based on aptamer proteins and designing the binding of aptamers to specific proteins is essential to advance understanding into processes of interaction between aptamer-protein. Despite the wide applications of aptamers, identification of interaction between aptamer protein is always inadequate and challenging. Therefore, it is necessary to develop a computational approach for achieving good predictions of interaction between aptamer-protein. In the present study, a novel method for enhancing the prediction of interacting aptamer-target pairs based on sequence features obtained from both aptamers and their target proteins by employing a novel k-mean based intuitionistic fuzzy rough feature selection method is proposed. Firstly, an intuitionistic fuzzy rough set model based on k nearest neighbour concept is proposed. Then, a novel feature selection technique is introduced by using this model. Furthermore, non-redundant and relevant features are selected from training as well as testing datasets by using proposed feature selection technique. Secondly, SMOTE (Synthetic Minority Oversampling Technique) is applied to obtain the optimal balanced training and testing datasets. Thirdly, we apply various machine learning algorithms on optimally balanced reduced training and testing datasets to evaluate their performances. Experimental results shows that the best prediction performance is obtained by boosted random forest learning algorithm. Using a 10 fold cross-validation test, the proposed method is a good performer, with sensitivity of 91.3, 86.4, specificity of 91.9, 84.8, overall accuracy of 91.60%, 85.60%, Mathews correlation coefficient of 0.832, 0.713, AUC (area under curve) of 0.969, 0.908, and g-means of 91.5, 85.5 on optimal balanced reduced training and testing datasets consisting of aptamer-protein interacting pairs. Finally, a comparative study of the best obtained results with the existing best results is presented, which clearly indicates that our proposed approach is the best performing approach till date. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
dc.identifier.doihttps://doi.org/10.1007/s12652-024-04837-4
dc.identifier.urihttp://172.23.0.11:4000/handle/123456789/5688
dc.relation.ispartofseriesJournal of Ambient Intelligence and Humanized Computing
dc.titleIntuitionistic fuzzy rough set model based on k-means and its application to enhance prediction of aptamer–protein interacting pairs

Files

Collections