DNA-binding proteins are the functional proteins in cells, which play an important role in various essential biological activities. An effective and fast computational method gDNA-Prot is proposed to predict DNA-binding proteins in this paper, which is a DNA-binding predictor that combines the support vector machine classifier and a novel kind of feature called graphical representation. The DNA-binding protein sequence information was described with the 20 probabilities of amino acids and the 23 new numerical graphical representation features of a protein sequence, based on 23 physicochemical properties of 20 amino acids. The Principal Components Analysis (PCA) was employed as feature selection method for removing the irrelevant features and reducing redundant features. The Sigmod function and Min-max normalization methods for PCA were applied to accelerate the training speed and obtain higher accuracy. Experiments demonstrated that the Principal Components Analysis with Sigmod function generated the best performance. The gDNA-Prot method was also compared with the DNAbinder, iDNA-Prot and DNA Prot. The results suggested that gDNA-Prot outperformed the DNAbinder and iDNA-Prot. Although the DNA-Prot outperformed gDNA-Prot, gDNA-Prot was faster and convenient to predict the DNA-binding proteins. Additionally, the proposed gNDA-Prot method is available at http://sourceforge.netiprojects/ gdnaprot. (C) 2016 Elsevier Ltd. All rights reserved.
机构:
Hebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R ChinaHebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R China
Zhang, Yanping
Xu, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Coll Math Sci, Tianjin 300071, Peoples R China
Nankai Univ, LPMC, Tianjin 300071, Peoples R ChinaHebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R China
Xu, Jun
Zheng, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Coll Math Sci, Tianjin 300071, Peoples R China
Nankai Univ, LPMC, Tianjin 300071, Peoples R ChinaHebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R China
Zheng, Wei
Zhang, Chen
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Coll Math Sci, Tianjin 300071, Peoples R China
Nankai Univ, LPMC, Tianjin 300071, Peoples R ChinaHebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R China
Zhang, Chen
Qiu, Xingye
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Coll Math Sci, Tianjin 300071, Peoples R China
Nankai Univ, LPMC, Tianjin 300071, Peoples R ChinaHebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R China
Qiu, Xingye
Chen, Ke
论文数: 0引用数: 0
h-index: 0
机构:
Tianjin Polytech Univ, Sch Comp Sci & Software Engn, Tianjin 300387, Peoples R ChinaHebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R China
Chen, Ke
Ruan, Jishou
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Coll Math Sci, Tianjin 300071, Peoples R China
Nankai Univ, LPMC, Tianjin 300071, Peoples R ChinaHebei Univ Engn, Sch Sci, Dept Math, Handan 056038, Peoples R China
机构:
Nanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R ChinaNanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R China
Ma, Xin
Wu, Jiansheng
论文数: 0引用数: 0
h-index: 0
机构:
Nanjing Univ Posts & Telecommun, Sch Geog & Biol Informat, Nanjing 210046, Jiangsu, Peoples R ChinaNanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R China
Wu, Jiansheng
Xue, Xiaoyun
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Agr Sci, Grad Sch, Beijing 100081, Peoples R ChinaNanjing Audit Univ, Golden Audit Coll, Nanjing 210029, Jiangsu, Peoples R China