Integrating reduced amino acid composition into PSSM for improving copper ion-binding protein prediction

被引:4
作者
Liu, Shanghua [1 ,2 ]
Liang, Yuchao [1 ,3 ]
Li, Jinzhao [1 ]
Yang, Siqi [1 ]
Liu, Ming [1 ]
Liu, Chengfang [1 ]
Yang, Dezhi [2 ]
Zuo, Yongchun [1 ,2 ,3 ]
机构
[1] Inner Mongolia Univ, Inst Biomed Sci, Sch Life Sci, State Key Lab Reprod Regulat & Breeding Grassland, Hohhot 010021, Peoples R China
[2] Inner Mongolia Int Mongolian Hosp, Hohhot 010065, Peoples R China
[3] Inner Mongolia Intelligent Union Big Data Acad, Digital Coll, Hohhot 010010, Peoples R China
基金
中国国家自然科学基金;
关键词
Copper ion -binding protein; Position-specific scoring matrix; Reduced amino acid composition; WEB SERVER; SITES; SEQUENCE; IRON; SVM;
D O I
10.1016/j.ijbiomac.2023.124993
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Copper ion-binding proteins play an essential role in metabolic processes and are critical factors in many diseases, such as breast cancer, lung cancer, and Menkes disease. Many algorithms have been developed for predicting metal ion classification and binding sites, but none have been applied to copper ion-binding proteins. In this study, we developed a copper ion-bound protein classifier, RPCIBP, which integrating the reduced amino acid composition into position-specific scoring matrix (PSSM). The reduced amino acid composition filters out a large number of useless evolutionary features, improving the operational efficiency and predictive ability of the model (feature dimension from 2900 to 200, ACC from 83 % to 85.1 %). Compared with the basic model using only three sequence feature extraction methods (ACC in training set between 73.8 %-86.2 %, ACC in test set between 69.3 %-87.5 %), the model integrating the evolutionary features of the reduced amino acid composition showed higher accuracy and robustness (ACC in training set between 83.1 %-90.8 %, ACC in test set between 79.1 %-91.9 %). Best copper ion-binding protein classifiers filtered by feature selection progress were deployed in a user-friendly web server (http://bioinfor.imu.edu.cn/RPCIBP). RPCIBP can accurately predict copper ionbinding proteins, which is convenient for further structural and functional studies, and conducive to mechanism exploration and target drug development.
引用
收藏
页数:9
相关论文
共 28 条
  • [11] Prediction of apoptosis protein subcellular location based on amphiphilic pseudo amino acid composition
    Su, Wenxia
    Deng, Shuyi
    Gu, Zhifeng
    Yang, Keli
    Ding, Hui
    Chen, Hui
    Zhang, Zhaoyue
    FRONTIERS IN GENETICS, 2023, 14
  • [12] iHSP-PseRAAAC: Identifying the heat shock protein families using pseudo reduced amino acid alphabet composition
    Feng, Peng-Mian
    Chen, Wei
    Lin, Hao
    Chou, Kuo-Chen
    ANALYTICAL BIOCHEMISTRY, 2013, 442 (01) : 118 - 125
  • [13] cnnAlpha: Protein disordered regions prediction by reduced amino acid alphabets and convolutional neural networks
    Oberti, Mauricio
    Vaisman, Iosif I.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2020, 88 (11) : 1472 - 1481
  • [14] Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou’s General Pseudo Amino Acid Composition
    Khurshid Ahmad
    Muhammad Waris
    Maqsood Hayat
    The Journal of Membrane Biology, 2016, 249 : 293 - 304
  • [15] DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation
    Liu, Bin
    Wang, Shanyi
    Wang, Xiaolong
    SCIENTIFIC REPORTS, 2015, 5
  • [16] Prediction of Subcellular Localization of Apoptosis Protein Using Chou's Pseudo Amino Acid Composition
    Lin, Hao
    Wang, Hao
    Ding, Hui
    Chen, Ying-Li
    Li, Qian-Zhong
    ACTA BIOTHEORETICA, 2009, 57 (03) : 321 - 330
  • [17] Prediction of Protein Phosphorylation Sites by Using the Composition of k-Spaced Amino Acid Pairs
    Zhao, Xiaowei
    Zhang, Wenyi
    Xu, Xin
    Ma, Zhiqiang
    Yin, Minghao
    PLOS ONE, 2012, 7 (10):
  • [18] Chou's pseudo amino acid composition improves sequence-based antifreeze protein prediction
    Mondal, Sukanta
    Pai, Priyadarshini P.
    JOURNAL OF THEORETICAL BIOLOGY, 2014, 356 : 30 - 35
  • [19] Prediction of protein-peptide-binding amino acid residues regions using machine learning algorithms
    Shafiee, Shima
    Fathi, Abdolhossein
    2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
  • [20] Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition
    Chen, Ying-Li
    Li, Qian-Zhong
    JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) : 377 - 381