Support vector machine with a Pearson VII function kernel for discriminating halophilic and non-halophilic proteins

被引:32
|
作者
Zhang, Guangya [1 ]
Ge, Huihua [1 ]
机构
[1] Huaqiao Univ, Dept Biotechnol & Bioengn, Xiamen 361021, Fujian, Peoples R China
关键词
Halophile; Pearson VII function kernel; Support vector machine; Amino acid composition; Hypersaline adaptation; AMINO-ACID-COMPOSITION; PREDICTING SUBCELLULAR-LOCALIZATION; GENOME SEQUENCE; STABILITY; ADAPTATION; CLASSIFIER; FRAGMENT; BACTERIA; MUTATION; DOMAINS;
D O I
10.1016/j.compbiolchem.2013.05.001
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Understanding of proteins adaptive to hypersaline environment and identifying them is a challenging task and would help to design stable proteins. Here, we have systematically analyzed the normalized amino acid compositions of 2121 halophilic and 2400 non-halophilic proteins. The results showed that halophilic protein contained more Asp at the expense of Lys, Ile, Cys and Met, fewer small and hydrophobic residues, and showed a large excess of acidic over basic amino acids. Then, we introduce a support vector machine method to discriminate the halophilic and non-halophilic proteins, by using a novel Pearson VII universal function based kernel. In the three validation check methods, it achieved an overall accuracy of 97.7%, 91.7% and 86.9% and outperformed other machine learning algorithms. We also address the influence of protein size on prediction accuracy and found the worse performance for small size proteins might be some significant residues (Cys and Lys) were missing in the proteins. (c) 2013 The Authors. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:16 / 22
页数:7
相关论文
共 50 条
  • [1] Designing predictors of halophilic and non-halophilic proteins using support vector machines
    Huang, Hui-Ling
    Charoenkwan, Phasit
    Srinivasulu, Yerukala Sathipati
    Lee, Hua-Chin
    Ho, Shinn-Ying
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2013, : 230 - 237
  • [2] Facilitating the application of Support Vector Regression by using a universal Pearson VII function based kernel
    Üstün, B
    Melssen, WJ
    Buydens, LMC
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 81 (01) : 29 - 40
  • [3] Research on kernel function of support vector machine
    Liu, Lijuan
    Shen, Bo
    Wang, Xing
    Journal of Computers (Taiwan), 2014, 25 (01) : 12 - 19
  • [4] Effect of polynomial, radial basis, and Pearson VII function kernels in support vector machine algorithm for classification of crayfish
    Garabaghi, Farid Hassanbaki
    Benzer, Recep
    Benzer, Semra
    Gunal, Aysel Caglan
    ECOLOGICAL INFORMATICS, 2022, 72
  • [5] Support vector machine based on person VII kernel function and its application in chemical pattern classification
    Zheng Qi-Fu
    Chen De-Zhao
    Liu Hua-Zhang
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2007, 35 (08) : 1142 - 1146
  • [6] The Study on Gauss Kernel Function in Support Vector Machine
    Wan Fuyong
    Zhao Ying
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 779 - 784
  • [7] Linear support vector machine based on kernel function
    Gao, Shang
    Hu, Xuekun
    Zhang, Zaiyue
    Cao, Cungen
    Journal of Computational Information Systems, 2009, 5 (04): : 1089 - 1095
  • [8] Dynamic Support Vector Machine by Distributing Kernel Function
    Shi Guangzhi
    Da Lianglong
    Hu Junchuan
    Zhou Yanxia
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 2, 2010, : 362 - 365
  • [9] A new trigonometric kernel function for support vector machine
    Sajad Fathi Hafshejani
    Zahra Moaberfard
    Iran Journal of Computer Science, 2023, 6 (2) : 137 - 145
  • [10] Critical support vector machine without kernel function
    Raicharoen, T
    Lursinap, C
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2532 - 2536