Scopus Indexed Publications

Paper Details


Title
GRUATT-AVP: leveraging a novel attention-based gated recurrent unit to advance the accuracy of antiviral peptide prediction

Author
, Md Faruk Hosen, S M Hasan Mahmud, Watshara Shoombuatong,

Email

Abstract

Antiviral peptides (AVPs), produced by all living organisms, play a vital role as the first line of immune defense against viral infections. AVPs present a promising path for developing novel antiviral therapies that target diverse viruses, including those resistant to existing drugs. However, identifying AVPs using wet lab methods is often costly and requires significant effort, and existing computational methods still have certain limitations. In this study, a novel attention-based Gated Recurrent Unit framework, named GRUATT-AVP, is proposed for accurate and fast AVPs identification. In GRUATT-AVP, several Natural Language Processing (NLP) based encoding mechanisms, including One-Hot Encoding, Word2Vec, GloVe, FastText, and ProtBert, are adopted to encode the peptide sequences. Sequentially, different embedding dimensions based on the k-mer with fixed lengths (1-6) and pooling were explored, aiming to capture the local context within the sequences. After that, we conducted another experiment to determine the best feature selection technique and integrated the SHAP technique to eliminate noise and less important encoded features, thereby improving the model's generalization performance. Finally, the most informative subset was fed into our developed GRUATT-AVP model to construct the GRUATT-AVP for classification. To understand the contribution of each component in the GRUATT-AVP model, an ablation study was performed, and the outcomes showed that our proposed model outperforms its other variants, establishing the model's stability and efficacy. In terms of AVP prediction results, GRUATT-AVP demonstrated better performance compared to several state-of-the-art classifiers, with an accuracy of 94.8% and an AUC of 0.986, suggesting promising therapeutic potential against viral infections. To ensure wide accessibility and practical usage, the GRUATT-AVP web server is available at https://gruatt-avp.vercel.app/ .


Keywords
Antiviral peptide; Attention mechanism; Feature extraction; Gated recurrent unit; Shapley values; Word embedding.

Journal or Conference Name
Scientific Reports

Publication Year
2025

Indexing
scopus