Identification of DNA-binding proteins using multi-features fusion and binary firefly optimization algorithm

被引:30
作者
Zhang, Jian [1 ]
Gao, Bo [1 ]
Chai, Haiting [1 ]
Ma, Zhiqiang [1 ]
Yang, Guifu [1 ,2 ]
机构
[1] Northeast Normal Univ, Sch Comp Sci & Informat Technol, Changchun 130117, Peoples R China
[2] Northeast Normal Univ, Off Informatizat Management & Planning, Changchun 130117, Peoples R China
来源
BMC BIOINFORMATICS | 2016年 / 17卷
关键词
DNA-binding proteins; Binary firefly algorithm; Feature selection; Parameters optimization; FEATURE-SELECTION; FREE-ENERGY; PREDICTION; SEQUENCE; RECOGNITION; SPECIFICITY; DESIGN;
D O I
10.1186/s12859-016-1201-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: DNA-binding proteins (DBPs) play fundamental roles in many biological processes. Therefore, the developing of effective computational tools for identifying DBPs is becoming highly desirable. Results: In this study, we proposed an accurate method for the prediction of DBPs. Firstly, we focused on the challenge of improving DBP prediction accuracy with information solely from the sequence. Secondly, we used multiple informative features to encode the protein. These features included evolutionary conservation profile, secondary structure motifs, and physicochemical properties. Thirdly, we introduced a novel improved Binary Firefly Algorithm (BFA) to remove redundant or noisy features as well as select optimal parameters for the classifier. The experimental results of our predictor on two benchmark datasets outperformed many state-of-the-art predictors, which revealed the effectiveness of our method. The promising prediction performance on a new-compiled independent testing dataset from PDB and a large-scale dataset from UniProt proved the good generalization ability of our method. In addition, the BFA forged in this research would be of great potential in practical applications in optimization fields, especially in feature selection problems. Conclusions: A highly accurate method was proposed for the identification of DBPs. A user-friendly web-server named iDbP (identification of DNA-binding Proteins) was constructed and provided for academic use.
引用
收藏
页数:12
相关论文
共 53 条
  • [31] Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays
    Mukherjee, S
    Berger, MF
    Jona, G
    Wang, XS
    Muzzey, D
    Snyder, M
    Young, RA
    Bulyk, ML
    [J]. NATURE GENETICS, 2004, 36 (12) : 1331 - 1339
  • [32] Identification of DNA-binding Proteins Using Structural, Electrostatic and Evolutionary Features
    Nimrod, Guy
    Szilagyi, Andras
    Leslie, Christina
    Ben-Tal, Nir
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2009, 387 (04) : 1040 - 1053
  • [33] NICOTINIC RECEPTOR-BINDING SITE PROBED WITH UNNATURAL AMINO-ACID-INCORPORATION IN INTACT-CELLS
    NOWAK, MW
    KEARNEY, PC
    SAMPSON, JR
    SAKS, ME
    LABARCA, CG
    SILVERMAN, SK
    ZHONG, W
    THORSON, JS
    ABELSON, JN
    DAVIDSON, N
    SCHULTZ, PG
    DOUGHERTY, DA
    LESTER, HA
    [J]. SCIENCE, 1995, 268 (5209) : 439 - 442
  • [34] Palit S., 2011, Int. conf. on computer and communication technology (ICCCT), V2, P428, DOI DOI 10.1109/ICCCT.2011.6075143
  • [35] 4-Hydroxynonenal As a Biological Signal: Molecular Basis and Pathophysiological Implications
    Parola, Maurizio
    Bellomo, Giorgio
    Robino, Gaia
    Barrera, Giuseppina
    Dianzani, Mario Umberto
    [J]. ANTIOXIDANTS & REDOX SIGNALING, 1999, 1 (03) : 255 - U21
  • [36] A novel optimization method, Effective Discrete Firefly Algorithm, for fuel reload design of nuclear reactors
    Poursalehi, N.
    Zolfaghari, A.
    Minuchehr, A.
    [J]. ANNALS OF NUCLEAR ENERGY, 2015, 81 : 263 - 275
  • [37] Origins of Specificity in Protein-DNA Recognition
    Rohs, Remo
    Jin, Xiangshu
    West, Sean M.
    Joshi, Rohit
    Honig, Barry
    Mann, Richard S.
    [J]. ANNUAL REVIEW OF BIOCHEMISTRY, VOL 79, 2010, 79 : 233 - 269
  • [38] PROTEIN-DNA recognition patterns and predictions
    Sarai, A
    Kono, H
    [J]. ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2005, 34 : 379 - 398
  • [39] Firefly-inspired algorithm for discrete optimization problems: An application to manufacturing cell formation
    Sayadi, Mohammad Kazem
    Hafezalkotob, Ashkan
    Naini, Seyed Gholamreza Jalali
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2013, 32 (01) : 78 - 84
  • [40] Nonnatural protein-protein interaction-pair design by key residues grafting
    Sen Liu
    Liu, Shiyong
    Zhu, Xiaolei
    Liang, Huanhuan
    Cao, Aoneng
    Chang, Zhijie
    Lai, Luhua
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (13) : 5330 - 5335