Pathogenicity Prediction of Single Amino Acid Variants With Machine Learning Model Based on Protein Structural Energies

被引:1
作者
Wu, Tzu-Hsuan [1 ]
Lin, Peng-Chan [2 ]
Chou, Hsin-Hung [3 ]
Shen, Meng-Ru [4 ]
Hsieh, Sun-Yuan [1 ,5 ]
机构
[1] Natl Cheng Kung Univ, Inst Med Informat, Tainan 701, Taiwan
[2] Natl Cheng Kung Univ Hosp, Dept Comp Sci & Informat Engn, Dept Internal Med, Tainan 704, Taiwan
[3] Natl Chi Nan Univ, Dept Comp Sci & Informat Engn, Puli Township 54516, Nantou County, Taiwan
[4] Natl Cheng Kung Univ, Dept Obstet & Gynecol, Dept Pharmacol, Coll Med, Tainan 701, Taiwan
[5] Natl Cheng Kung Univ, Inst Mfg Informat Syst, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
关键词
Machine learning; pathogenicity prediction; protein structure energy; single amino acid variants; SNP; MUTATIONS; POLYMORPHISMS;
D O I
10.1109/TCBB.2021.3139048
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The most popular tools for predicting pathogenicity of single amino acid variants (SAVs) were developed based on sequence-based techniques. SAVs may change protein structure and function. In the context of van derWaals force and disulfide bridge calculations, no method directly predicts the impact of mutations on the energies of the protein structure. Here, we combined machine learning methods and energy scores of protein structures calculated by Rosetta Energy Function 2015 to predict SAV pathogenicity. The accuracy level of our model (0.76) is higher than that of six prediction tools. Further analyses revealed that the differential reference energies, attractive energies, and solvation of polar atoms between wildtype and mutant side-chains played essential roles in distinguishing benign from pathogenic variants. These features indicated the physicochemical properties of amino acids, which were observed in 3D structures instead of sequences. We added 16 features to Rhapsody (the prediction tool we used for our data set) and consequently improved its performance. The results indicated that these energy scores were more appropriate and more detailed representations of the pathogenicity of SAVs.
引用
收藏
页码:606 / 615
页数:10
相关论文
共 50 条
  • [31] Prognosis Prediction of Stroke based on Machine Learning and Explanation Model
    Qin, Qiuli
    Zhou, Xuehan
    Jiang, Yong
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2021, 16 (02) : 1 - 13
  • [32] Study on Machine Learning based Heart Disease Prediction Model
    Zhang, Shihan
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 346 - 352
  • [33] Machine learning-based model for prediction of concrete strength
    Aswal, Vivek Singh
    Singh, B. K.
    Maheshwari, Rohit
    MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2025, 8 (01)
  • [34] Multivariate prediction model of geothermal parameters based on machine learning
    Zheng, Shuang-Fei
    Li, Xu
    Wang, Meng
    ENERGY, 2025, 316
  • [35] Construction of the prediction model for multiple myeloma based on machine learning
    Cai, Jiangying
    Liu, Zhenhua
    Wang, Yingying
    Yang, Wanxia
    Sun, Zhipeng
    You, Chongge
    INTERNATIONAL JOURNAL OF LABORATORY HEMATOLOGY, 2024, 46 (05) : 918 - 926
  • [36] Gene-specific machine learning for pathogenicity prediction of rare BRCA1 and BRCA2 missense variants
    Kang, Moonjong
    Kim, Seonhwa
    Lee, Da-Bin
    Hong, Changbum
    Hwang, Kyu-Baek
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [37] Risk score prediction model based on single nucleotide polymorphism for predicting malaria: a machine learning approach
    Tai, Kah Yee
    Dhaliwal, Jasbir
    Wong, KokSheik
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [38] Risk score prediction model based on single nucleotide polymorphism for predicting malaria: a machine learning approach
    Kah Yee Tai
    Jasbir Dhaliwal
    KokSheik Wong
    BMC Bioinformatics, 23
  • [39] An exploration on the machine-learning-based stroke prediction model
    Zhi, Shenshen
    Hu, Xiefei
    Ding, Yan
    Chen, Huajian
    Li, Xun
    Tao, Yang
    Li, Wei
    FRONTIERS IN NEUROLOGY, 2024, 15
  • [40] BTRPP: A Rapid PGA Prediction Model Based on Machine Learning
    Ren, Tao
    Wang, Pengyu
    Chen, Hongfeng
    Liu, Xinliang
    Meng, Fanchun
    Ma, Yanlu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61