Predicting subcellular location of protein with evolution information and sequence-based deep learning

被引:0
|
作者
Liao, Zhijun [1 ,2 ]
Pan, Gaofeng [2 ]
Sun, Chao [2 ]
Tang, Jijun [2 ,3 ]
机构
[1] Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, 1 Xuefu North Road, University Town, Fuzhou,FJ,350122, China
[2] Department of Computer Science and Engineering, University of South Carolina, 550 Assembly St, Columbia,SC,29208, United States
[3] College of Electrical and Power Engineering, Taiyuan University of Technology, No. 79 Yinze West Street, Taiyuan,SX,030024, China
基金
中国国家自然科学基金;
关键词
Deep learning - Biology - Location - Convolutional neural networks - Forecasting - Classification (of information);
D O I
暂无
中图分类号
学科分类号
摘要
Background: Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information of proteins. In order to improve the prediction accuracy, we present a deep learning-based method to predict protein subcellular locations. Results: Our method utilizes not only amino acid compositions sequence but also evolution matrices of proteins. Our method uses a bidirectional long short-term memory network that processes the entire protein sequence and a convolutional neural network that extracts features from protein sequences. The position specific scoring matrix is used as a supplement to protein sequences. Our method was trained and tested on two benchmark datasets. The experiment results show that our method yields accurate results on the two datasets with an average precision of 0.7901, ranking loss of 0.0758 and coverage of 1.2848. Conclusion: The experiment results show that our method outperforms five methods currently available. According to those experiments, we can see that our method is an acceptable alternative to predict protein subcellular location. © 2021, The Author(s).
引用
收藏
相关论文
共 50 条
  • [1] Predicting subcellular location of protein with evolution information and sequence-based deep learning
    Liao, Zhijun
    Pan, Gaofeng
    Sun, Chao
    Tang, Jijun
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 10)
  • [2] Predicting subcellular location of protein with evolution information and sequence-based deep learning
    Zhijun Liao
    Gaofeng Pan
    Chao Sun
    Jijun Tang
    BMC Bioinformatics, 22
  • [3] Predicting protein-protein interactions through sequence-based deep learning
    Hashemifar, Somaye
    Neyshabur, Behnam
    Khan, Aly A.
    Xu, Jinbo
    BIOINFORMATICS, 2018, 34 (17) : 802 - 810
  • [4] Predicting Protein Subcellular Location Based on a Novel Sequence Numerical Model
    Chen, Haowen
    Chen, Xia
    Hu, Qingming
    Cao, Zhi
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (01) : 82 - 87
  • [5] N-terminal sequence-based prediction of subcellular location
    Evangelia I Petsalakis
    Pantelis G Bagos
    Zoi I Litou
    Stavros J Hamodrakas
    BMC Bioinformatics, 6 (Suppl 3)
  • [6] N-terminal sequence-based prediction of subcellular location
    不详
    BMC BIOINFORMATICS, 2005, 6
  • [7] DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Mall, Raghvendra
    Bensmail, Halima
    BIOINFORMATICS, 2019, 35 (13) : 2216 - 2225
  • [8] DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Bensmail, Halima
    Mall, Raghvendra
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2747 - 2749
  • [9] DeepSol: a deep learning framework for sequence-based protein solubility prediction
    Khurana, Sameer
    Rawi, Reda
    Kunji, Khalid
    Chuang, Gwo-Yu
    Bensmail, Halima
    Mall, Raghvendra
    BIOINFORMATICS, 2018, 34 (15) : 2605 - 2613
  • [10] Unified rational protein engineering with sequence-based deep representation learning
    Alley, Ethan C.
    Khimulya, Grigory
    Biswas, Surojit
    AlQuraishi, Mohammed
    Church, George M.
    NATURE METHODS, 2019, 16 (12) : 1315 - +