SMOPredT4SE: An Effective Prediction of Bacterial Type IV Secreted Effectors Using SVM Training With SMO

被引:5
作者
Yan, Zihao [1 ]
Chen, Dong [2 ]
Teng, Zhixia [1 ]
Wang, Donghua [3 ]
Li, Yanjuan [1 ]
机构
[1] Northeast Forestry Univ, Sch Informat & Comp Engn, Harbin 150040, Peoples R China
[2] Heilongjiang Inst Technol, Sch Comp Sci & Technol, Harbin 150050, Peoples R China
[3] Heilongjiang Prov Land Reclamat Headquarters Gen, Dept Gen Surg, Harbin 150088, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine learning; protein classification; sequential minimal optimization; type IV secreted effector; MACHINE-LEARNING-METHODS; AMINO-ACID-COMPOSITION; NEURAL P SYSTEMS; FEATURE-SELECTION; IDENTIFICATION; DRUG; TRANSLOCATION; VALIDATION; LEGIONELLA; PROTEINS;
D O I
10.1109/ACCESS.2020.2971091
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Various bacterial pathogens can deliver their secreted effectors to host cells via type IV secretion system (T4SS) and cause host diseases. Since T4SS secreted effectors (T4SEs) play important roles in the interaction between pathogens and host, identifying T4SEs is crucial to understanding of the pathogenic mechanism of T4SS. We established an effective predictor called SMOPredT4SE to identify T4SEs from protein sequences. SMOPredT4SE employed combination features of series correlation pseudo amino acid composition and position-specific scoring matrix to present protein sequences, and employed support vector machines (SVM) training with sequential minimal optimization (SMO) arithmetic to train the prediction model (To distinguish it from the traditional SVM, we will abbreviate it as SMO later). In the 5-fold cross-validation test, SMOPredT4SE's overall accuracy was 95.6%. Experiments on comparison with other feature, classifiers, and existing methods are conducted. Experimental results show the effectiveness of SMOPredT4SE in predicting T4SEs.
引用
收藏
页码:25570 / 25578
页数:9
相关论文
共 107 条
  • [1] SDBP-Pred: Prediction of single-stranded and double-stranded DNA-binding proteins by extending consensus sequence and K-segmentation strategies into PSSM
    Ali, Farman
    Arif, Muhammad
    Khan, Zaheer Ullah
    Kabir, Muhammad
    Ahmed, Saeed
    Yu, Dong-Jun
    [J]. ANALYTICAL BIOCHEMISTRY, 2020, 589
  • [2] Comprehensive assessment and performance improvement of effector protein predictors for bacterial secretion systems III, IV and VI
    An, Yi
    Wang, Jiawei
    Li, Chen
    Leier, Andre
    Marquez-Lago, Tatiana
    Wilksch, Jonathan
    Zhang, Yang
    Webb, Geoffrey I.
    Song, Jiangning
    Lithgow, Trevor
    [J]. BRIEFINGS IN BIOINFORMATICS, 2018, 19 (01) : 148 - 161
  • [3] [Anonymous], [No title captured]
  • [4] [Anonymous], [No title captured]
  • [5] [Anonymous], [No title captured]
  • [6] [Anonymous], [No title captured]
  • [7] Awais M., IEEE ACM T COMPUT BI
  • [8] Type IV secretion systems and their effectors in bacterial pathogenesis
    Backert, S
    Meyer, TF
    [J]. CURRENT OPINION IN MICROBIOLOGY, 2006, 9 (02) : 207 - 217
  • [9] BUCKLIN SE, 1994, J ENDOTOXIN RES, V1, P45
  • [10] Genome-Scale Identification of Legionella pneumophila Effectors Using a Machine Learning Approach
    Burstein, David
    Zusman, Tal
    Degtyar, Elena
    Viner, Ram
    Segal, Gil
    Pupko, Tal
    [J]. PLOS PATHOGENS, 2009, 5 (07)