SignalP 6.0 predicts all five types of signal peptides using protein language models

被引:1486
作者
Teufel, Felix [1 ,2 ]
Almagro Armenteros, Jose Juan [3 ]
Johansen, Alexander Rosenberg [4 ]
Gislason, Magnus Halldor [5 ]
Pihl, Silas Irby [1 ]
Tsirigos, Konstantinos D. [6 ]
Winther, Ole [7 ,8 ]
Brunak, Soren [3 ]
von Heijne, Gunnar [9 ,10 ]
Nielsen, Henrik [1 ]
机构
[1] Tech Univ Denmark, Sect Bioinformat, Dept Hlth Technol, Lyngby, Denmark
[2] Swiss Fed Inst Technol, Dept Biosyst Sci & Engn, Basel, Switzerland
[3] Univ Copenhagen, Fac Hlth & Med Sci, Novo Nordisk Fdn Ctr Prot Res, Copenhagen, Denmark
[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[5] Rigshosp Copenhagen Univ Hosp, Ctr Genom Med, Copenhagen, Denmark
[6] EMBL EBI, Wellcome Genome Campus, Cambridge, England
[7] Univ Copenhagen, Dept Biol, Bioinformat Ctr, Copenhagen, Denmark
[8] Tech Univ Denmark, Sect Cognit Syst, Dept Appl Math & Comp Sci, Lyngby, Denmark
[9] Stockholm Univ, Dept Biochem & Biophys, Stockholm, Sweden
[10] Stockholm Univ, Sci Life Lab, Solna, Sweden
基金
瑞典研究理事会;
关键词
SEQUENCE;
D O I
10.1038/s41587-021-01156-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Signal peptides (SPs) are short amino acid sequences that control protein secretion and translocation in all living organisms. SPs can be predicted from sequence data, but existing algorithms are unable to detect all known types of SPs. We introduce SignalP 6.0, a machine learning model that detects all five SP types and is applicable to metagenomic data. A new version of SignalP predicts all types of signal peptides.
引用
收藏
页码:1023 / +
页数:6
相关论文
共 34 条
[1]   Unified rational protein engineering with sequence-based deep representation learning [J].
Alley, Ethan C. ;
Khimulya, Grigory ;
Biswas, Surojit ;
AlQuraishi, Mohammed ;
Church, George M. .
NATURE METHODS, 2019, 16 (12) :1315-+
[2]   SignalP 5.0 improves signal peptide predictions using deep neural networks [J].
Armenteros, Jose Juan Almagro ;
Tsirigos, Konstantinos D. ;
Sonderby, Casper Kaae ;
Petersen, Thomas Nordahl ;
Winther, Ole ;
Brunak, Soren ;
von Heijne, Gunnar ;
Nielsen, Henrik .
NATURE BIOTECHNOLOGY, 2019, 37 (04) :420-+
[3]   UniProt: a worldwide hub of protein knowledge [J].
Bateman, Alex ;
Martin, Maria-Jesus ;
Orchard, Sandra ;
Magrane, Michele ;
Alpi, Emanuele ;
Bely, Benoit ;
Bingley, Mark ;
Britto, Ramona ;
Bursteinas, Borisas ;
Busiello, Gianluca ;
Bye-A-Jee, Hema ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Castro, Leyla Garcia ;
Garmiri, Penelope ;
Georghiou, George ;
Gonzales, Daniel ;
Gonzales, Leonardo ;
Hatton-Ellis, Emma ;
Ignatchenko, Alexandr ;
Ishtiaq, Rizwan ;
Jokinen, Petteri ;
Joshi, Vishal ;
Jyothi, Dushyanth ;
Lopez, Rodrigo ;
Luo, Jie ;
Lussi, Yvonne ;
MacDougall, Alistair ;
Madeira, Fabio ;
Mahmoudy, Mahdi ;
Menchi, Manuela ;
Nightingale, Andrew ;
Onwubiko, Joseph ;
Palka, Barbara ;
Pichler, Klemens ;
Pundir, Sangya ;
Qi, Guoying ;
Raj, Shriya ;
Renaux, Alexandre ;
Lopez, Milagros Rodriguez ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Speretta, Elena ;
Turner, Edward ;
Tyagi, Nidhi ;
Vasudev, Preethi ;
Volynkin, Vladimir ;
Wardell, Tony .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D506-D515
[4]  
Bhandari V., 2014, The Prokaryotes: Other Major Lineages of Bacteria and The Archaea, P989, DOI [10.1007/0-387-30743-5, DOI 10.1007/0-387-30743-5]
[5]   Low-N protein engineering with data-efficient deep learning [J].
Biswas, Surojit ;
Khimulya, Grigory ;
Alley, Ethan C. ;
Esvelt, Kevin M. ;
Church, George M. .
NATURE METHODS, 2021, 18 (04) :389-+
[6]   Type IV pili: dynamics, biophysics and functional consequences [J].
Craig, Lisa ;
Forest, Katrina T. ;
Maier, Berenike .
NATURE REVIEWS MICROBIOLOGY, 2019, 17 (07) :429-440
[7]   Membrane Proteases in the Bacterial Protein Secretion and Quality Control Pathway [J].
Dalbey, Ross E. ;
Wang, Peng ;
van Dijl, Jan Maarten .
MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 2012, 76 (02) :311-330
[8]   Learned Embeddings from Deep Learning to Visualize and Predict Protein Sets [J].
Dallago, Christian ;
Schuetze, Konstantin ;
Heinzinger, Michael ;
Olenyi, Tobias ;
Littmann, Maria ;
Lu, Amy X. ;
Yang, Kevin K. ;
Min, Seonwoo ;
Yoon, Sungroh ;
Morton, James T. ;
Rost, Burkhard .
CURRENT PROTOCOLS, 2021, 1 (05)
[9]   ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins [J].
de Castro, Edouard ;
Sigrist, Christian J. A. ;
Gattiker, Alexandre ;
Bulliard, Virginie ;
Langendijk-Genevaux, Petra S. ;
Gasteiger, Elisabeth ;
Bairoch, Amos ;
Hulo, Nicolas .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W362-W365
[10]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171