A Protein-DNA Binding Site Prediction Method Based on Multi-View Feature Fusion of Adjacent Residue

被引:0
作者
Yang, Ji [1 ]
Zhang, Shuning [1 ]
机构
[1] Anhui Univ Chinese Med, Affiliated Hosp 1, Hefei 230031, Peoples R China
关键词
Protein-DNA binding site prediction; neighboring residue correlations; feature processing; multi-view features combining; support vector machine; SEQUENCE; NETWORKS; SHAPE;
D O I
10.1109/ACCESS.2023.3297207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The interaction between proteins and DNA occurs widely during the replication and transcription of DNA and other life activities. Therefore, the identification of protein- and DNA-binding sites is important for the study of protein function and drug design. Accurate prediction of binding sites has become a challenging and significant task. Although numerous studies have been conducted, prediction is challenging. In this study, a new protein-DNA binding site prediction method was proposed. This method is based on neighboring residue correlations. It uses an improved feature representation method that weighted combines several protein characteristics after a series of processing of the features and chooses a support vector machine as the prediction engine. Experiments on benchmark datasets and independent test datasets show that the proposed method has better predictability than other protein-DNA binding site predictors. This method is complementary to the existing protein-DNA binding site predictors and will be useful in the field of biotechnology.
引用
收藏
页码:79609 / 79623
页数:15
相关论文
共 54 条
  • [1] Principal component analysis
    Abdi, Herve
    Williams, Lynne J.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04): : 433 - 459
  • [2] ASAView: Database and tool for solvent accessibility representation in proteins
    Ahmad, S
    Gromiha, M
    Fawareh, H
    Sarai, A
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [3] PSSM-based prediction of DNA binding sites in proteins
    Ahmad, S
    Sarai, A
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [4] DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information
    Ali, Farman
    Ahmed, Saeed
    Swati, Zar Nawab Khan
    Akbar, Shahid
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2019, 33 (07) : 645 - 658
  • [5] [Anonymous], 1971, NATURE-NEW BIOL, V233, P223, DOI [DOI 10.1038/NEWBIO233223B0, 10.1038/newbio233223b0]
  • [6] Asghari Mehdi Poursheikhali, 2019, Avicenna Journal of Medical Biotechnology, V11, P104
  • [7] Balakrishnama S., 1998, Linear discriminant analysis‐a brief tutorial, V18, P1
  • [8] Beamer S, 2013, SCI PROGRAMMING-NETH, V21, P137, DOI [10.3233/SPR-130370, 10.1155/2013/702694]
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Broomhead D. S., 1988, Complex Systems, V2, P321