AI-Driven Deep Learning Techniques in Protein Structure Prediction

被引:31
作者
Chen, Lingtao [1 ]
Li, Qiaomu [1 ]
Nasif, Kazi Fahim Ahmad [1 ]
Xie, Ying [1 ]
Deng, Bobin [1 ]
Niu, Shuteng [2 ]
Pouriyeh, Seyedamin [1 ]
Dai, Zhiyu [3 ]
Chen, Jiawei [4 ]
Xie, Chloe Yixin [1 ]
机构
[1] Kennesaw State Univ, Coll Comp & Software Engn, Marietta, GA 30060 USA
[2] Bowling Green State Univ, Dept Comp Sci, Bowling Green, OH 43403 USA
[3] Washington Univ, John T Milliken Dept Med, Div Pulm & Crit Care Med, Sch Med St Louis, St Louis, MO 63110 USA
[4] Univ Calif Berkeley, Div Comp Data Sci & Soc, Berkeley, CA 94720 USA
关键词
protein structure; computational methods; artificial intelligence; machine learning; deep learning; transformer; AlphaFold; protein modeling; bioinformatics; healthcare; META-THREADING-SERVER; HOMOLOGY DETECTION; FOLD-RECOGNITION; SWISS-MODEL; GENERATION; ALIGNMENT; ACCURATE; TARGETS; QUALITY; BIOLOGY;
D O I
10.3390/ijms25158426
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein structure prediction is important for understanding their function and behavior. This review study presents a comprehensive review of the computational models used in predicting protein structure. It covers the progression from established protein modeling to state-of-the-art artificial intelligence (AI) frameworks. The paper will start with a brief introduction to protein structures, protein modeling, and AI. The section on established protein modeling will discuss homology modeling, ab initio modeling, and threading. The next section is deep learning-based models. It introduces some state-of-the-art AI models, such as AlphaFold (AlphaFold, AlphaFold2, AlphaFold3), RoseTTAFold, ProteinBERT, etc. This section also discusses how AI techniques have been integrated into established frameworks like Swiss-Model, Rosetta, and I-TASSER. The model performance is compared using the rankings of CASP14 (Critical Assessment of Structure Prediction) and CASP15. CASP16 is ongoing, and its results are not included in this review. Continuous Automated Model EvaluatiOn (CAMEO) complements the biennial CASP experiment. Template modeling score (TM-score), global distance test total score (GDT_TS), and Local Distance Difference Test (lDDT) score are discussed too. This paper then acknowledges the ongoing difficulties in predicting protein structure and emphasizes the necessity of additional searches like dynamic protein behavior, conformational changes, and protein-protein interactions. In the application section, this paper introduces some applications in various fields like drug design, industry, education, and novel protein development. In summary, this paper provides a comprehensive overview of the latest advancements in established protein modeling and deep learning-based models for protein structure predictions. It emphasizes the significant advancements achieved by AI and identifies potential areas for further investigation.
引用
收藏
页数:21
相关论文
共 131 条
[81]  
Pearson W R, 1994, Methods Mol Biol, V25, P365
[82]   RaptorX: Exploiting structure information for protein alignment by statistical inference [J].
Peng, Jian ;
Xu, Jinbo .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 :161-171
[83]  
Petsko G., 2004, Protein structure and function
[84]   NMR Methods for Structural Characterization of Protein-Protein Complexes [J].
Purslow, Jeffrey A. ;
Khatiwada, Balabhadra ;
Bayro, Marvin J. ;
Venditti, Vincenzo .
FRONTIERS IN MOLECULAR BIOSCIENCES, 2020, 7
[85]   Artificial Intelligence and Biosensors in Healthcare and Its Clinical Relevance: A Review [J].
Qureshi, Rizwan ;
Irfan, Muhammad ;
Ali, Hazrat ;
Khan, Arshad ;
Nittala, Aditya Shekhar ;
Ali, Shawkat ;
Shah, Abbas ;
Gondal, Taimoor Muzaffar ;
Sadak, Ferhat ;
Shah, Zubair ;
Hadi, Muhammad Usman ;
Khan, Sheheryar ;
Al-Tashi, Qasem ;
Wu, Jia ;
Bermak, Amine ;
Alam, Tanvir .
IEEE ACCESS, 2023, 11 :61600-61620
[86]   Protein-Ligand Scoring with Convolutional Neural Networks [J].
Ragoza, Matthew ;
Hochuli, Joshua ;
Idrobo, Elisa ;
Sunseri, Jocelyn ;
Koes, David Ryan .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (04) :942-957
[87]  
Remmert M, 2012, NAT METHODS, V9, P173, DOI [10.1038/NMETH.1818, 10.1038/nmeth.1818]
[88]   Continuous Automated Model EvaluatiOn (CAMEO)-Perspectives on the future of fully automated evaluation of structure prediction methods [J].
Robin, Xavier ;
Haas, Juergen ;
Gumienny, Rafal ;
Smolinski, Anna ;
Tauriello, Gerardo ;
Schwede, Torsten .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) :1977-1986
[89]   Exploring protein fitness landscapes by directed evolution [J].
Romero, Philip A. ;
Arnold, Frances H. .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2009, 10 (12) :866-876
[90]   Comparison of sequence profiles. Strategies for structural predictions using sequence information [J].
Rychlewski, L ;
Jaroszewski, L ;
Li, WZ ;
Godzik, A .
PROTEIN SCIENCE, 2000, 9 (02) :232-241