DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks

被引:65
作者
Savojardo, Castrense [1 ]
Bruciaferri, Niccolo [1 ]
Tartari, Giacomo [1 ,2 ]
Martelli, Pier Luigi [1 ]
Casadio, Rita [1 ,2 ]
机构
[1] Univ Bologna, Dept Pharm & Biotechnol FaBiT, Biocomp Grp, Bologna, Italy
[2] Italian Natl Res Council CNR, Inst Biomembranes Bioenerget & Mol Biotechnol IBI, Bari, Italy
关键词
AMINO-ACID-COMPOSITION; CLEAVAGE;
D O I
10.1093/bioinformatics/btz512
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation The correct localization of proteins in cell compartments is a key issue for their function. Particularly, mitochondrial proteins are physiologically active in different compartments and their aberrant localization contributes to the pathogenesis of human mitochondrial pathologies. Many computational methods exist to assign protein sequences to subcellular compartments such as nucleus, cytoplasm and organelles. However, a substantial lack of experimental evidence in public sequence databases hampered so far a finer grain discrimination, including also intra-organelle compartments. Results We describe DeepMito, a novel method for predicting protein sub-mitochondrial cellular localization. Taking advantage of powerful deep-learning approaches, such as convolutional neural networks, our method is able to achieve very high prediction performances when discriminating among four different mitochondrial compartments (matrix, outer, inner and intermembrane regions). The method is trained and tested in cross-validation on a newly generated, high-quality dataset comprising 424 mitochondrial proteins with experimental evidence for sub-organelle localizations. We benchmark DeepMito towards the only one recent approach developed for the same task. Results indicate that DeepMito performances are superior. Finally, genomic-scale prediction on a highly-curated dataset of human mitochondrial proteins further confirms the effectiveness of our approach and suggests that DeepMito is a good candidate for genome-scale annotation of mitochondrial protein subcellular localization. Availability and implementation The DeepMito web server as well as all datasets used in this study are available at http://busca.biocomp.unibo.it/deepmito. A standalone version of DeepMito is available on DockerHub at https://hub.docker.com/r/bolognabiocomp/deepmito. DeepMito source code is available on GitHub at https://github.com/BolognaBiocomp/deepmito Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:56 / 64
页数:9
相关论文
共 33 条
[1]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Deep learning for computational biology [J].
Angermueller, Christof ;
Parnamaa, Tanel ;
Parts, Leopold ;
Stegle, Oliver .
MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)
[4]   DeepLoc: prediction of protein subcellular localization using deep learning [J].
Armenteros, Jose Juan Almagro ;
Sonderby, Casper Kaae ;
Sonderby, Soren Kaae ;
Nielsen, Henrik ;
Winther, Ole .
BIOINFORMATICS, 2017, 33 (21) :3387-3395
[5]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[6]   Extensive feature detection of N-terminal protein sorting signals [J].
Bannai, H ;
Tamada, Y ;
Maruyama, O ;
Nakai, K ;
Miyano, S .
BIOINFORMATICS, 2002, 18 (02) :298-305
[7]   Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence [J].
Du, Pufeng ;
Li, Yanda .
BMC BIOINFORMATICS, 2006, 7 (1)
[8]   SubMito-PSPCP: Predicting Protein Submitochondrial Locations by Hybridizing Positional Specific Physicochemical Properties with Pseudoamino Acid Compositions [J].
Du, Pufeng ;
Yu, Yuan .
BIOMED RESEARCH INTERNATIONAL, 2013, 2013
[9]   Mitochondrial protein import: Common principles and physiological networks [J].
Dudek, Jan ;
Rehling, Peter ;
van der Laan, Martin .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH, 2013, 1833 (02) :274-285
[10]   Locating proteins in the cell using TargetP, SignalP and related tools [J].
Emanuelsson, Olof ;
Brunak, Soren ;
von Heijne, Gunnar ;
Nielsen, Henrik .
NATURE PROTOCOLS, 2007, 2 (04) :953-971