Integrating reduced amino acid composition into PSSM for improving copper ion-binding protein prediction

被引:4
|
作者
Liu, Shanghua [1 ,2 ]
Liang, Yuchao [1 ,3 ]
Li, Jinzhao [1 ]
Yang, Siqi [1 ]
Liu, Ming [1 ]
Liu, Chengfang [1 ]
Yang, Dezhi [2 ]
Zuo, Yongchun [1 ,2 ,3 ]
机构
[1] Inner Mongolia Univ, Inst Biomed Sci, Sch Life Sci, State Key Lab Reprod Regulat & Breeding Grassland, Hohhot 010021, Peoples R China
[2] Inner Mongolia Int Mongolian Hosp, Hohhot 010065, Peoples R China
[3] Inner Mongolia Intelligent Union Big Data Acad, Digital Coll, Hohhot 010010, Peoples R China
基金
中国国家自然科学基金;
关键词
Copper ion -binding protein; Position-specific scoring matrix; Reduced amino acid composition; WEB SERVER; SITES; SEQUENCE; IRON; SVM;
D O I
10.1016/j.ijbiomac.2023.124993
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Copper ion-binding proteins play an essential role in metabolic processes and are critical factors in many diseases, such as breast cancer, lung cancer, and Menkes disease. Many algorithms have been developed for predicting metal ion classification and binding sites, but none have been applied to copper ion-binding proteins. In this study, we developed a copper ion-bound protein classifier, RPCIBP, which integrating the reduced amino acid composition into position-specific scoring matrix (PSSM). The reduced amino acid composition filters out a large number of useless evolutionary features, improving the operational efficiency and predictive ability of the model (feature dimension from 2900 to 200, ACC from 83 % to 85.1 %). Compared with the basic model using only three sequence feature extraction methods (ACC in training set between 73.8 %-86.2 %, ACC in test set between 69.3 %-87.5 %), the model integrating the evolutionary features of the reduced amino acid composition showed higher accuracy and robustness (ACC in training set between 83.1 %-90.8 %, ACC in test set between 79.1 %-91.9 %). Best copper ion-binding protein classifiers filtered by feature selection progress were deployed in a user-friendly web server (http://bioinfor.imu.edu.cn/RPCIBP). RPCIBP can accurately predict copper ionbinding proteins, which is convenient for further structural and functional studies, and conducive to mechanism exploration and target drug development.
引用
收藏
页数:9
相关论文
共 28 条
  • [1] A protein fold classifier formed by fusing different modes of pseudo amino acid composition via PSSM
    Kavousi, Kaveh
    Moshiri, Behzad
    Sadeghi, Mehdi
    Araabi, Babak N.
    Moosavi-Movahedi, Ali Akbar
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (01) : 1 - 9
  • [2] Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile
    Verma, Ruchi
    Varshney, Grish C.
    Raghava, G. P. S.
    AMINO ACIDS, 2010, 39 (01) : 101 - 110
  • [3] Prediction of mitochondrial proteins of malaria parasite using split amino acid composition and PSSM profile
    Ruchi Verma
    Grish C. Varshney
    G. P. S. Raghava
    Amino Acids, 2010, 39 : 101 - 110
  • [4] Protein location prediction using atomic composition and global features of the amino acid sequence
    Cherian, Betsy Sheena
    Nair, Achuthsankar S.
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2010, 391 (04) : 1670 - 1674
  • [5] Research progress of reduced amino acid alphabets in protein analysis and prediction
    Liang, Yuchao
    Yang, Siqi
    Zheng, Lei
    Wang, Hao
    Zhou, Jian
    Huang, Shenghui
    Yang, Lei
    Zuo, Yongchun
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 3503 - 3510
  • [6] Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition
    Ahmad, Khurshid
    Waris, Muhammad
    Hayat, Maqsood
    JOURNAL OF MEMBRANE BIOLOGY, 2016, 249 (03) : 293 - 304
  • [7] Prediction of Thermophilic Protein with Pseudo Amino Acid Composition: An Approach from Combined Feature Selection and Reduction
    Wang, De
    Yang, Liang
    Fu, Zhengqi
    Xia, Jingbo
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (07) : 684 - 689
  • [8] Improvement of protein binding sites prediction by selecting amino acid residues' features
    Mirceva, Georgina
    Kulakov, Andrea
    JOURNAL OF STRUCTURAL BIOLOGY, 2015, 189 (01) : 9 - 19
  • [9] Improving Protein Localization Prediction Using Amino Acid Group Based Physichemical Encoding
    Hu, Jianjun
    Zhang, Fan
    BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 248 - 258
  • [10] Prediction of membrane protein types by using dipeptide and pseudo amino acid composition-based composite features
    Hayat, M.
    Khan, A.
    IET COMMUNICATIONS, 2012, 6 (18) : 3257 - 3264