COMPARISON OF MACHINE LEARNING TECHNIQUES FOR PREDICTING NLR PROTEINS

被引:1
作者
Nadia [1 ]
Gandotra, Ekta [2 ]
Kumar, Narendra [1 ]
机构
[1] Jaypee Univ Informat Technol, Dept Comp Sci & Engn, Solan 173234, HP, India
[2] Jaypee Univ Informat Technol, Dept Comp Sci & Engn & Informat Technol, Solan 173234, HP, India
来源
BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS | 2023年 / 35卷 / 02期
关键词
NLR; Machine learning; SVM; SMO; Random forest; Cross-validation;
D O I
10.4015/S1016237222500508
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
The nucleotide-binding domain leucine-rich repeat-containing (NLR) proteins plays significant role in the intestinal tissue repair and innate immunity. It recently added to the members of innate immunity effectors molecules. It also plays an essential role in intestinal microbiota and recently emerged as a crucial hit for developing ulcerative colitis (UC) and colitis-associated cancer (CAC). A machine learning-based approach for predicting NLR proteins has been developed. In this study, we present a comparison of three supervised machine learning algorithms. Using ProtR and POSSUM Packages, the features are extracted for the dataset used in this work. The models are trained with the input compositional features generated using dipeptide composition, amino acid composition, etc., as well as Position Specific Scoring Matrix (PSSM) based compositions. The dataset consists of 390 proteins for the negative and positive datasets. The five-fold cross-validation (CV) is used to optimize Sequential Minimal Optimization (SMO) library of Support Vector Machine (LIB-SVM) and Random Forest (RF) parameters, and the best model was selected. The proposed work performs rationally well with an accuracy of 90.91% and 93.94% for RF as the best classifier for the Amino Acid Composition (AAC) and PSE_PSSM-based model. We believe that this method is a reliable, rapid and useful prediction method for NLR Protein.
引用
收藏
页数:15
相关论文
共 36 条
[1]   An N-terminal motif in NLR immune receptors is functionally conserved across distantly related plant species [J].
Adachi, Hiroaki ;
Contreras, Mauricio P. ;
Harant, Adeline ;
Wu, Chih-hang ;
Derevnina, Lida ;
Sakai, Toshiyuki ;
Duggan, Cian ;
Moratto, Eleonora ;
Bozkurt, Tolga O. ;
Maqbool, Abbas ;
Win, Joe ;
Kamoun, Sophien .
ELIFE, 2019, 8
[2]   Machine learning can identify newly diagnosed patients with CLL at high risk of infection [J].
Agius, Rudi ;
Brieghel, Christian ;
Andersen, Michael A. ;
Pearson, Alexander T. ;
Ledergerber, Bruno ;
Cozzi-Lepri, Alessandro ;
Louzoun, Yoram ;
Andersen, Christen L. ;
Bergstedt, Jacob ;
von Stemann, Jakob H. ;
Jorgensen, Mette ;
Tang, Man-Hung Eric ;
Fontes, Magnus ;
Bahlo, Jasmin ;
Herling, Carmen D. ;
Hallek, Michael ;
Lundgren, Jens ;
MacPherson, Cameron Ross ;
Larsen, Jan ;
Niemann, Carsten U. .
NATURE COMMUNICATIONS, 2020, 11 (01)
[3]   A Machine Learning Based Intrusion Detection System for Mobile Internet of Things [J].
Amouri, Amar ;
Alaparthy, Vishwa T. ;
Morgera, Salvatore D. .
SENSORS, 2020, 20 (02)
[4]   NLR diversity, helpers and integrated domains: making sense of the NLR IDentity [J].
Baggs, E. ;
Dagdas, G. ;
Krasileva, K. V. .
CURRENT OPINION IN PLANT BIOLOGY, 2017, 38 :59-67
[5]   Regulation of intestinal microbiota by the NLR protein family [J].
Biswas, Amlan ;
Kobayashi, Koichi S. .
INTERNATIONAL IMMUNOLOGY, 2013, 25 (04) :207-214
[6]   LightGBM-PPI: Predicting protein-protein interactions through LightGBM with multi-information fusion [J].
Chen, Cheng ;
Zhang, Qingmei ;
Ma, Qin ;
Yu, Bin .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 191 :54-64
[7]   Role of Nlrp6 and Nlrp12 in the maintenance of intestinal homeostasis [J].
Chen, Grace Y. .
EUROPEAN JOURNAL OF IMMUNOLOGY, 2014, 44 (02) :321-327
[8]  
Fletcher RR, 2019, IEEE ENG MED BIO, P2234, DOI [10.1109/EMBC.2019.8857942, 10.1109/embc.2019.8857942]
[9]  
Hartmann S, 2019, IEEE ENG MED BIO, P1842, DOI [10.1109/EMBC.2019.8857006, 10.1109/embc.2019.8857006]
[10]  
Higashi K, 2019, IEEE ENG MED BIO, P788, DOI [10.1109/EMBC.2019.8857830, 10.1109/embc.2019.8857830]