Pathogenicity Prediction of Single Amino Acid Variants With Machine Learning Model Based on Protein Structural Energies

被引:1
|
作者
Wu, Tzu-Hsuan [1 ]
Lin, Peng-Chan [2 ]
Chou, Hsin-Hung [3 ]
Shen, Meng-Ru [4 ]
Hsieh, Sun-Yuan [1 ,5 ]
机构
[1] Natl Cheng Kung Univ, Inst Med Informat, Tainan 701, Taiwan
[2] Natl Cheng Kung Univ Hosp, Dept Comp Sci & Informat Engn, Dept Internal Med, Tainan 704, Taiwan
[3] Natl Chi Nan Univ, Dept Comp Sci & Informat Engn, Puli Township 54516, Nantou County, Taiwan
[4] Natl Cheng Kung Univ, Dept Obstet & Gynecol, Dept Pharmacol, Coll Med, Tainan 701, Taiwan
[5] Natl Cheng Kung Univ, Inst Mfg Informat Syst, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
关键词
Machine learning; pathogenicity prediction; protein structure energy; single amino acid variants; SNP; MUTATIONS; POLYMORPHISMS;
D O I
10.1109/TCBB.2021.3139048
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The most popular tools for predicting pathogenicity of single amino acid variants (SAVs) were developed based on sequence-based techniques. SAVs may change protein structure and function. In the context of van derWaals force and disulfide bridge calculations, no method directly predicts the impact of mutations on the energies of the protein structure. Here, we combined machine learning methods and energy scores of protein structures calculated by Rosetta Energy Function 2015 to predict SAV pathogenicity. The accuracy level of our model (0.76) is higher than that of six prediction tools. Further analyses revealed that the differential reference energies, attractive energies, and solvation of polar atoms between wildtype and mutant side-chains played essential roles in distinguishing benign from pathogenic variants. These features indicated the physicochemical properties of amino acids, which were observed in 3D structures instead of sequences. We added 16 features to Rhapsody (the prediction tool we used for our data set) and consequently improved its performance. The results indicated that these energy scores were more appropriate and more detailed representations of the pathogenicity of SAVs.
引用
收藏
页码:606 / 615
页数:10
相关论文
共 50 条
  • [21] Amino Acid k-mer Feature Extraction for Quantitative Antimicrobial Resistance (AMR) Prediction by Machine Learning and Model Interpretation for Biological Insights
    ValizadehAslani, Taha
    Zhao, Zhengqiao
    Sokhansanj, Bahrad A.
    Rosen, Gail L.
    BIOLOGY-BASEL, 2020, 9 (11): : 1 - 92
  • [22] Advances in Protein Contact Map Prediction Based on Machine Learning
    Xie, Jiang
    Ding, Wang
    Chen, Luonan
    Guo, Qiang
    Zhang, Wu
    MEDICINAL CHEMISTRY, 2015, 11 (03) : 265 - 270
  • [23] Prediction of recombinant protein overexpression in Escherichia coli using a machine learning based model (RPOLP)
    Habibi, Narjeskhatoon
    Norouzi, Alireza
    Hashim, Siti Z. Mohd
    Shamsir, Mohd Shahir
    Samian, Razip
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 66 : 330 - 336
  • [24] A machine learning approach for prediction of Lipid-Interacting residues in amino acid sequences
    Irausquin, Stephanie Jimenez
    Wang, Liangjiang
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 315 - 319
  • [25] Machine learning based prediction model for plastic hinge length calculation of reinforced concrete structural walls
    Barkhordari, Mohammad Sadegh
    Jawdhari, Akram
    ADVANCES IN STRUCTURAL ENGINEERING, 2023, 26 (09) : 1714 - 1734
  • [26] Linking Protein Stability to Pathogenicity: Predicting Clinical Significance of Single-Missense Mutations in Ocular Proteins Using Machine Learning
    Majid, Iyad
    Sergeev, Yuri V.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (21)
  • [27] Machine learning-based fatigue lifetime prediction of structural steels
    Arvanitis, Konstantinos
    Nikolakopoulos, Pantelis
    Pavlou, Dimitrios
    Farmanbar, Mina
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 125 : 55 - 66
  • [28] AnoChem: Prediction of chemical structural abnormalities based on machine learning models
    Gu, Changdai
    Jang, Woo Dae
    Oh, Kwang-Seok
    Ryu, Jae Yong
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 2116 - 2121
  • [29] Enhancing Missense Variant Pathogenicity Prediction with MissenseNet: Integrating Structural Insights and ShuffleNet-Based Deep Learning Techniques
    Liu, Jing
    Chen, Yingying
    Huang, Kai
    Guan, Xiao
    BIOMOLECULES, 2024, 14 (09)
  • [30] Development of Patent Technology Prediction Model Based on Machine Learning
    Lee, Chih-Wei
    Tao, Feng
    Ma, Yu-Yu
    Lin, Hung-Lung
    AXIOMS, 2022, 11 (06)