iAFPs-Mv-BiTCN: Predicting antifungal peptides using self-attention transformer embedding and transform evolutionary based multi-view features with bidirectional temporal convolutional networks

被引:68
作者
Akbar, Shahid [1 ,2 ]
Zou, Quan [1 ,3 ]
Raza, Ali [4 ]
Alarfaj, Fawaz Khaled [5 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 610054, Peoples R China
[2] Abdul Wali Khan Univ Mardan, Dept Comp Sci, Kp 23200, Pakistan
[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou 324000, Peoples R China
[4] Qurtuba Univ Sci & Informat Technol, Dept Phys & Numer Sci, Peshawar 25124, KP, Pakistan
[5] King Faisal Univ KFU, Sch Business, Dept Management Informat Syst MIS, Al Hasa 31982, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Antifungal peptides; Word embedding; BERT; Feature selection; Bidirectional temporal convolutional networks; CORROSION TYPE; PROTEIN; CLASSIFICATION; IDENTIFICATION;
D O I
10.1016/j.artmed.2024.102860
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Globally, fungal infections have become a major health concern in humans. Fungal diseases generally occur due to the invading fungus appearing on a specific portion of the body and becoming hard for the human immune system to resist. The recent emergence of COVID-19 has intensely increased different nosocomial fungal infections. The existing wet -laboratory -based medications are expensive, time-consuming, and may have adverse side effects on normal cells. In the last decade, peptide therapeutics have gained significant attention due to their high specificity in targeting affected cells without affecting healthy cells. Motivated by the significance of peptide -based therapies, we developed a highly discriminative prediction scheme called iAFPs-Mv-BiTCN to predict antifungal peptides correctly. The training peptides are encoded using word embedding methods such as skip -gram and attention mechanism -based bidirectional encoder representation using transformer. Additionally, transform -based evolutionary features are generated using the Pseduo position -specific scoring matrix using discrete wavelet transform (PsePSSM-DWT). The fused vector of word embedding and evolutionary descriptors is formed to compensate for the limitations of single encoding methods. A Shapley Additive exPlanations (SHAP) based global interpolation approach is applied to reduce training costs by choosing the optimal feature set. The selected feature set is trained using a bi-directional temporal convolutional network (BiTCN). The proposed iAFPs-Mv-BiTCN model achieved a predictive accuracy of 98.15 % and an AUC of 0.99 using training samples. In the case of the independent samples, our model obtained an accuracy of 94.11 % and an AUC of 0.98. Our iAFPsMv-BiTCN model outperformed existing models with a -4 % and -5 % higher accuracy using training and independent samples, respectively. The reliability and efficacy of the proposed iAFPs-Mv-BiTCN model make it a valuable tool for scientists and may perform a beneficial role in pharmaceutical design and research academia.
引用
收藏
页数:13
相关论文
共 84 条
[1]   In Silico Approach for Prediction of Antifungal Peptides [J].
Agrawal, Piyush ;
Bhalla, Sherry ;
Chaudhary, Kumardeep ;
Kumar, Rajesh ;
Sharma, Meenu ;
Raghava, Gajendra P. S. .
FRONTIERS IN MICROBIOLOGY, 2018, 9
[2]   Overlap and diversity in antimicrobial peptide databases: compiling a non-redundant set of sequences [J].
Aguilera-Mendoza, Longendri ;
Marrero-Ponce, Yovani ;
Tellez-Ibarra, Roberto ;
Llorente-Quesada, Monica T. ;
Salgado, Jesus ;
Barigye, Stephen J. ;
Liu, Jun .
BIOINFORMATICS, 2015, 31 (15) :2553-2559
[3]   iAFPs-EnC-GA: Identifying antifungal peptides using sequential and evolutionary descriptors based multi-information fusion and ensemble learning approach [J].
Ahmad, Ashfaq ;
Akbar, Shahid ;
Tahir, Muhammad ;
Hayat, Maqsood ;
Ali, Farman .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 222
[4]   Deep-AntiFP: Prediction of antifungal peptides using distanct multi-informative features incorporating with deep neural networks [J].
Ahmad, Ashfaq ;
Akbar, Shahid ;
Khan, Salman ;
Hayat, Maqsood ;
Ali, Farman ;
Ahmed, Aftab ;
Tahir, Muhammad .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 208
[5]   pAtbP-EnC: Identifying Anti-Tubercular Peptides Using Multi-Feature Representation and Genetic Algorithm-Based Deep Ensemble Model [J].
Akbar, Shahid ;
Raza, Ali ;
Al Shloul, Tamara ;
Ahmad, Ashfaq ;
Saeed, Aamir ;
Ghadi, Yazeed Yasin ;
Mamyrbayev, Orken ;
Tag-Eldin, Elsayed .
IEEE ACCESS, 2023, 11 :137099-137114
[6]   Identifying Neuropeptides via Evolutionary and Sequential Based Multi-Perspective Descriptors by Incorporation With Ensemble Classification Strategy [J].
Akbar, Shahid ;
Mohamed, Heba G. ;
Ali, Hashim ;
Saeed, Aamir ;
Khan, Aftab Ahmed ;
Gul, Sarah ;
Ahmad, Ashfaq ;
Ali, Farman ;
Ghadi, Yazeed Yasin ;
Assam, Muhammad .
IEEE ACCESS, 2023, 11 :49024-49034
[7]   Prediction of Antiviral peptides using transform evolutionary & SHAP analysis based descriptors by incorporation with ensemble learning strategy [J].
Akbar, Shahid ;
Ali, Farman ;
Hayat, Maqsood ;
Ahmad, Ashfaq ;
Khan, Salman ;
Gul, Sarah .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 230
[8]   cACP-DeepGram: Classification of anticancer peptides via deep neural network and skip-gram-based word embedding model [J].
Akbar, Shahid ;
Hayat, Maqsood ;
Tahir, Muhammad ;
Khan, Salman ;
Alarfaj, Fawaz Khaled .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 131
[9]   iHBP-DeepPSSM: Identifying hormone binding proteins using PsePSSM based evolutionary features and deep learning approach [J].
Akbar, Shahid ;
Khan, Salman ;
Ali, Farman ;
Hayat, Maqsood ;
Qasim, Muhammad ;
Gul, Sarah .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 204
[10]   iAFP-gap-SMOTE: An Efficient Feature Extraction Scheme Gapped Dipeptide Composition is Coupled with an Oversampling Technique for Identification of Antifreeze Proteins [J].
Akbar, Shahid ;
Hayat, Maqsood ;
Kabir, Muhammad ;
Iqbal, Muhammad .
LETTERS IN ORGANIC CHEMISTRY, 2019, 16 (04) :294-302