Identification of DNA-binding protein based multiple kernel model

被引:2
作者
Qian, Yuqing [1 ]
Shang, Tingting [1 ]
Guo, Fei [2 ]
Wang, Chunliang [3 ]
Cui, Zhiming [1 ]
Ding, Yijie [4 ]
Wu, Hongjie [1 ]
机构
[1] Suzhou Univ Sci & Technol, Coll Elect & Informat Engn, Suzhou, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[3] Soochow Univ, Affiliated Hosp 2, Suzhou, Peoples R China
[4] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
DNA-binding proteins; multiple kernel learning; local kernel alignment; restricted kernel machine; SUPPORT VECTOR MACHINES; AMINO-ACID-COMPOSITION; WEB SERVER; PREDICTION; INFORMATION; INTEGRATION; DEEP; PSEAAC; DPP; RNA;
D O I
10.3934/mbe.2023586
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA-binding proteins (DBPs) play a critical role in the development of drugs for treating genetic diseases and in DNA biology research. It is essential for predicting DNA-binding proteins more accurately and efficiently. In this paper, a Laplacian Local Kernel Alignment-based Restricted Kernel Machine (LapLKA-RKM) is proposed to predict DBPs. In detail, we first extract features from the protein sequence using six methods. Second, the Radial Basis Function (RBF) kernel function is utilized to construct pre-defined kernel metrics. Then, these metrics are combined linearly by weights calculated by LapLKA. Finally, the fused kernel is input to RKM for training and prediction. Independent tests and leave-one-out cross-validation were used to validate the performance of our method on a small dataset and two large datasets. Importantly, we built an online platform to represent our model, which is now freely accessible via http://8.130.69.121:8082/.
引用
收藏
页码:13149 / 13170
页数:22
相关论文
共 87 条
  • [1] Reptile Search Algorithm (RSA): A nature-inspired meta-heuristic optimizer
    Abualigah, Laith
    Abd Elaziz, Mohamed
    Sumari, Putra
    Geem, Zong Woo
    Gandomi, Amir H.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [2] Applications, Deployments, and Integration of Internet of Drones (IoD): A Review
    Abualigah, Laith
    Diabat, Ali
    Sumari, Putra
    Gandomi, Amir H.
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (22) : 25532 - 25546
  • [3] Aquila Optimizer: A novel meta-heuristic optimization algorithm
    Abualigah, Laith
    Yousri, Dalia
    Abd Elaziz, Mohamed
    Ewees, Ahmed A.
    Al-qaness, Mohammed A. A.
    Gandomi, Amir H.
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 157 (157)
  • [4] The Arithmetic Optimization Algorithm
    Abualigah, Laith
    Diabat, Ali
    Mirjalili, Seyedali
    Elaziz, Mohamed Abd
    Gandomi, Amir H.
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 376
  • [5] Dwarf Mongoose Optimization Algorithm
    Agushaka, Jeffrey O.
    Ezugwu, Absalom E.
    Abualigah, Laith
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 391
  • [6] Identification of DNA N4-methylcytosine Sites via Multiview Kernel Sparse Representation Model
    Ai C.
    Tiwari P.
    Yang H.
    Ding Y.
    Tang J.
    Guo F.
    [J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1236 - 1245
  • [7] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [8] DBP-CNN: Deep learning-based prediction of DNA-binding proteins by coupling discrete cosine transform with two-dimensional convolutional neural network
    Barukab, Omar
    Ali, Farman
    Alghamdi, Wajdi
    Bassam, Yoosef
    Khan, Sher Afzal
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments
    Buck, MJ
    Lieb, JD
    [J]. GENOMICS, 2004, 83 (03) : 349 - 360