LipoSVM: Prediction of Lysine Lipoylation in Proteins based on the Support Vector Machine

被引:4
作者
Wu, Meiqi [1 ]
Lu, Pengchao [2 ]
Yang, Yingxi [3 ]
Liu, Liwen [1 ]
Wang, Hui [4 ]
Xu, Yan [1 ]
Chu, Jixun [1 ]
机构
[1] Univ Sci & Technol Beijing, Dept Appl Math, Beijing 100083, Peoples R China
[2] China Petr Pipeline Engn Co Ltd, Equipment Leasing Co, Langfang City 065000, Hebei, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Chem & Biol Engn, Hong Kong, Peoples R China
[4] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
关键词
Lysine lipoylation; prediction; amino acids; support vector machine; post-translational modifications; scoring matrix; PYRUVATE-DEHYDROGENASE COMPLEX; LIPOIC ACID; ACETYLATION; CANCER;
D O I
10.2174/1389202919666191014092843
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Lysine lipoylation which is a rare and highly conserved post-translational modification of proteins has been considered as one of the most important processes in the biological field. To obtain a comprehensive understanding of regulatory mechanism of lysine lipoylation, the key is to identify lysine lipoylated sites. The experimental methods are expensive and laborious. Due to the high cost and complexity of experimental methods, it is urgent to develop computational ways to predict lipoylation sites. Methodology: In this work, a predictor named LipoSVM is developed to accurately predict lipoylation sites. To overcome the problem of an unbalanced sample, synthetic minority over-sampling technique (SMOTE) is utilized to balance negative and positive samples. Furthermore, different ratios of positive and negative samples are chosen as training sets. Results: By comparing five different encoding schemes and five classification algorithms, LipoSVM is constructed finally by using a training set with positive and negative sample ratio of 1:1, combining with position-specific scoring matrix and support vector machine. The best performance achieves an accuracy of 99.98% and AUC 0.9996 in 10-fold cross-validation. The AUC of independent test set reaches 0.9997, which demonstrates the robustness of LipoSVM. The analysis between lysine lipoylation and non-lipoylation fragments shows significant statistical differences. Conclusion: A good predictor for lysine lipoylation is built based on position-specific scoring matrix and support vector machine. Meanwhile, an online webserver LipoSVM can be freely downloaded from https://github.com/stars20180811/LipoSVM.
引用
收藏
页码:362 / 370
页数:9
相关论文
共 40 条
  • [1] ACETYLATION + METHYLATION OF HISTONES + THEIR POSSIBLE ROLE IN REGULATION OF RNA SYNTHESIS
    ALLFREY, VG
    FAULKNER, R
    MIRSKY, AE
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1964, 51 (05) : 786 - +
  • [2] EPSILON-N-METHYL-LYSINE IN BACTERIAL FLAGELLAR PROTEIN
    AMBLER, RP
    REES, MW
    [J]. NATURE, 1959, 184 (4679) : 56 - 57
  • [3] Azevedo Cristina, 2015, Advances in Biological Regulation, V60, P144, DOI 10.1016/j.jbior.2015.09.008
  • [4] ALPHA-LIPOIC ACID IS AN EFFECTIVE INHIBITOR OF HUMAN-IMMUNODEFICIENCY-VIRUS (HIV-1) REPLICATION
    BAUR, A
    HARRER, T
    PEUKERT, M
    JAHN, G
    KALDEN, JR
    FLECKENSTEIN, B
    [J]. KLINISCHE WOCHENSCHRIFT, 1991, 69 (15): : 722 - 724
  • [5] SMOTE for high-dimensional class-imbalanced data
    Blagus, Rok
    Lusa, Lara
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [6] Mapping the lipoylation site of Arabidopsis thaliana plastidial dihydrolipoamide S-acetyltransferase using mass spectrometry and site-directed mutagenesis
    Casteel, Jill
    Miernyk, Jan A.
    Thelen, Jay J.
    [J]. PLANT PHYSIOLOGY AND BIOCHEMISTRY, 2011, 49 (11) : 1355 - 1361
  • [7] Dysregulation of glucose transport, glycolysis, TCA cycle and glutaminolysis by oncogenes and tumor suppressors in cancer cells
    Chen, Jin-Qiang
    Russo, Jose
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA-REVIEWS ON CANCER, 2012, 1826 (02): : 370 - 384
  • [8] Function, attachment and synthesis of lipoic acid in Escherichia coli
    Cronan, JE
    Zhao, X
    Jiang, YF
    [J]. ADVANCES IN MICROBIAL PHYSIOLOGY, VOL 50, 2005, 50 : 103 - 146
  • [9] Post-translational protein modifications in malaria parasites
    Doerig, Christian
    Rayner, Julian C.
    Scherf, Artur
    Tobin, Andrew B.
    [J]. NATURE REVIEWS MICROBIOLOGY, 2015, 13 (03) : 160 - 172
  • [10] Tyr-301 Phosphorylation Inhibits Pyruvate Dehydrogenase by Blocking Substrate Binding and Promotes the Warburg Effect
    Fan, Jun
    Kang, Hee-Bum
    Shan, Changliang
    Elf, Shannon
    Lin, Ruiting
    Xie, Jianxin
    Gu, Ting-Lei
    Aguiar, Mike
    Lonning, Scott
    Chung, Tae-Wook
    Arellano, Martha
    Khoury, Hanna J.
    Shin, Dong M.
    Khuri, Fadlo R.
    Boggon, Titus J.
    Kang, Sumin
    Chen, Jing
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2014, 289 (38) : 26533 - 26541