ADOPT: intrinsic protein disorder prediction through deep bidirectional transformers

被引:12
|
作者
Redl, Istvan [1 ]
Fisicaro, Carlo [1 ]
Dutton, Oliver [1 ]
Hoffmann, Falk [1 ]
Henderson, Louie [1 ]
Owens, Benjamin M. J. [1 ]
Heberling, Matthew [1 ]
Paci, Emanuele [1 ,2 ]
Tamiola, Kamil [1 ]
机构
[1] Peptone Ltd, 370 Grays Inn Rd, London WC1X 8BB, England
[2] Univ Bologna, Dept Phys & Astron Augusto Righi, I-40127 Bologna, Italy
关键词
UNSTRUCTURED REGIONS; ACCURATE PREDICTION; WEB SERVER; SEQUENCES;
D O I
10.1093/nargab/lqad041
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Intrinsically disordered proteins (IDPs) are important for a broad range of biological functions and are involved in many diseases. An understanding of intrinsic disorder is key to develop compounds that target IDPs. Experimental characterization of IDPs is hindered by the very fact that they are highly dynamic. Computational methods that predict disorder from the amino acid sequence have been proposed. Here, we present ADOPT (Attention DisOrder PredicTor), a new predictor of protein disorder. ADOPT is composed of a self-supervised encoder and a supervised disorder predictor. The former is based on a deep bidirectional transformer, which extracts dense residue-level representations from Facebook's Evolutionary Scale Modeling library. The latter uses a database of nuclear magnetic resonance chemical shifts, constructed to ensure balanced amounts of disordered and ordered residues, as a training and a test dataset for protein disorder. ADOPT predicts whether a protein or a specific region is disordered with better performance than the best existing predictors and faster than most other proposed methods (a few seconds per sequence). We identify the features that are relevant for the prediction performance and show that good performance can already be gained with <100 features. ADOPT is available as a stand-alone package at https://github.com/PeptoneLtd/ADOPT and as a web server at https://adopt.peptone.io/.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning
    Jack Hanson
    Kuldip K.Paliwal
    Thomas Litfin
    Yaoqi Zhou
    Genomics,Proteomics & Bioinformatics, 2019, 17 (06) : 645 - 656
  • [2] SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning
    Hanson, Jack
    Paliwal, Kuldip K.
    Litfin, Thomas
    Zhou, Yaoqi
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2019, 17 (06) : 645 - 656
  • [3] SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning
    Jack Hanson
    Kuldip KPaliwal
    Thomas Litfin
    Yaoqi Zhou
    Genomics,Proteomics & Bioinformatics, 2019, (06) : 645 - 656
  • [4] Deep learning in prediction of intrinsic disorder in proteins
    Zhao, Bi
    Kurgan, Lukasz
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 1286 - 1294
  • [5] Rapid prediction and analysis of protein intrinsic disorder
    Dayhoff, Guy W.
    Uversky, Vladimir N.
    BIOPHYSICAL JOURNAL, 2023, 122 (03) : 61A - 61A
  • [6] Critical assessment of protein intrinsic disorder prediction
    Marco Necci
    Damiano Piovesan
    Silvio C. E. Tosatto
    Nature Methods, 2021, 18 : 472 - 481
  • [7] Rapid prediction and analysis of protein intrinsic disorder
    Dayhoff, Guy W. W.
    Uversky, Vladimir N. N.
    PROTEIN SCIENCE, 2022, 31 (12)
  • [8] Critical assessment of protein intrinsic disorder prediction
    Necci, Marco
    Piovesan, Damiano
    Tosatto, Silvio C. E.
    NATURE METHODS, 2021, 18 (05) : 472 - +
  • [9] PredIDR: Accurate prediction of protein intrinsic disorder regions using deep convolutional neural network
    Han, Kun-Sop
    Song, Se-Ryong
    Pak, Myong-hyon
    Kim, Chol-Song
    Ri, Chol-Pyok
    Del Conte, Alessio
    Piovesan, Damiano
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2025, 284
  • [10] TrajBERT- DSSM: Deep bidirectional transformers for vessel trajectory understanding and destination prediction
    Zhang, Chengkai
    Bin, Junchi
    Liu, Zheng
    OCEAN ENGINEERING, 2024, 297