MutBLESS: A tool to identify disease-prone sites in cancer using deep learning

被引:1
|
作者
Pandey, Medha [1 ]
Gromiha, M. Michael [1 ]
机构
[1] Indian Inst Technol Madras, Bhupat & Jyoti Mehta Sch Biosci, Dept Biotechnol, Chennai 600036, India
来源
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR BASIS OF DISEASE | 2023年 / 1869卷 / 06期
关键词
Deep neural networks; Breast cancer; Algorithms; Motifs; Hotspots; HOTSPOT MUTATIONS; PREDICTION; GENE; PATHOGENICITY; DATABASE; SEARCH;
D O I
10.1016/j.bbadis.2023.166721
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Understanding the molecular basis and impact of mutations at different stages of cancer are long-standing challenges in cancer biology. Identification of driver mutations from experiments is expensive and time intensive. In the present study, we collected the data for experimentally known driver mutations in 22 different cancer types and classified them into six categories: breast cancer (BRCA), acute myeloid leukaemia (LAML), endometrial carcinoma (EC), stomach cancer (STAD), skin cancer (SKCM), and other cancer types which contains 5747 disease prone and 5514 neutral sites in 516 proteins. The analysis of amino acid distribution along mutant sites revealed that the motifs AAA and LR are preferred in disease-prone sites whereas QPP and QF are dominant in neutral sites. Further, we developed a method using deep neural networks to predict disease-prone sites with amino acid sequence-based features such as physicochemical properties, secondary structure, tri-peptide motifs and conservation scores. We obtained an average AUC of 0.97 in five cancer types BRCA, LAML, EC, STAD and SKCM in a test dataset and 0.72 in all other cancer types together. Our method showed excellent performance for identifying cancer-specific mutations with an average sensitivity, specificity, and accuracy of 96.56 %, 97.39 %, and 97.64 %, respectively. We developed a web server for identifying cancer-prone sites, and it is available at htt ps://web.iitm.ac.in/bioinfo2/MutBLESS/index.html. We suggest that our method can serve as an effective method to identify disease-prone sites and assist to develop therapeutic strategies.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] CHURCH GENEALOGY RECORDS MAY IDENTIFY DISEASE-PRONE PEOPLE
    GLITZ, W
    COMPUTERS AND PEOPLE, 1977, 26 (03): : 24 - 25
  • [2] Identification of heart disease-prone personality using oscillometric blood pressure measurements
    Jung, Seungah
    Shin, Youngsuk
    TECHNOLOGY AND HEALTH CARE, 2015, 23 : S211 - S222
  • [3] The histopathological features of the surgical endometriosis model using systemic autoimmune disease-prone mice
    Hosotani, Marina
    Akita, Machiko
    Ueda, Hiromi
    Watanabe, Takafumi
    JOURNAL OF VETERINARY MEDICAL SCIENCE, 2023, 85 (01): : 1 - 8
  • [4] Leveraging Protein Dynamics to Identify Functional Phosphorylation Sites using Deep Learning Models
    Zhu, Fei
    Yang, Sijie
    Meng, Fanwang
    Zheng, Yuxiang
    Ku, Xin
    Luo, Cheng
    Hu, Guang
    Liang, Zhongjie
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (14) : 3331 - 3345
  • [5] Using Deep Learning for MRI to Identify Responders to Chemoradiotherapy in Rectal Cancer
    Koh, Dow-Mu
    RADIOLOGY, 2020, 296 (01) : 65 - 66
  • [6] Investigating the application of deep learning to identify pedestrian collision-prone zones
    Ghomi, Haniyeh
    Hussein, Mohamed
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2023, 15 (11) : 1172 - 1202
  • [7] GENERALIZATION OF PROSTATE CANCER CLASSIFICATION FOR MULTIPLE SITES USING DEEP LEARNING
    Arvidsson, Ida
    Overgaard, Niels Christian
    Marginean, Felicia-Elena
    Krzyzanowska, Agnieszka
    Bjartell, Anders
    Astrom, Kalle
    Heyden, Anders
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 191 - 194
  • [8] Incorporating Deep Learning With Word Embedding to Identify Plant Ubiquitylation Sites
    Wang, Hongfei
    Wang, Zhuo
    Li, Zhongyan
    Lee, Tzong-Yi
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2020, 8
  • [9] Deep Learning to Identify Transcription Start Sites from CAGE Data
    Zheng, Hansi
    Li, Xiaoman
    Hu, Haiyan
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 168 - 172
  • [10] Can Deep Learning Identify Tomato Leaf Disease?
    Zhang, Keke
    Wu, Qiufeng
    Liu, Anwang
    Meng, Xiangyan
    ADVANCES IN MULTIMEDIA, 2018, 2018