Variable Length Character N-Gram Embedding of Protein Sequences for Secondary Structure Prediction
| dc.contributor.author | Sharma A.K.; Srivastava R. | |
| dc.date.accessioned | 2025-05-23T11:26:38Z | |
| dc.description.abstract | Background: The prediction of a protein's secondary structure from its amino acid sequence is an essential step towards predicting its 3-D structure. The prediction performance im-proves by incorporating homologous multiple sequence alignment information. Since homologous details not available for all proteins. Therefore, it is necessary to predict the protein secondary structure from single sequences. Objective and Methods: Protein secondary structure predicted from their primary sequences using n-gram word embedding and deep recurrent neural network. Protein secondary structure depends on local and long-range neighbor residues in primary sequences. In the proposed work, the local contextual information of amino acid residues captures variable-length character n-gram words. An embedding vector represents these variable-length character n-gram words. Further, the bidirectional long short-term memory (Bi-LSTM) model is used to capture the long-range contexts by extract-ing the past and future residues information in primary sequences. Results: The proposed model evaluates on three public datasets ss.txt, RS126, and CASP9. The model shows the Q3 accuracy of 92.57%, 86.48%, and 89.66% for ss.txt, RS126, and CASP9. Conclusion: The proposed model performance compares with state-of-the-art methods available in the literature. After a comparative analysis, it observed that the proposed model performs better than state-of-the-art methods. © 2021 Bentham Science Publishers. | |
| dc.identifier.doi | https://doi.org/10.2174/0929866527666201103145635 | |
| dc.identifier.uri | http://172.23.0.11:4000/handle/123456789/10534 | |
| dc.relation.ispartofseries | Protein and Peptide Letters | |
| dc.title | Variable Length Character N-Gram Embedding of Protein Sequences for Secondary Structure Prediction |