Deep Learning Prediction Boosts Phosphoproteomics-Based Discoveries Through Improved Phosphopeptide Identification

被引:6
|
作者
Yi, Xinpei [1 ,2 ,6 ]
Wen, Bo [1 ,2 ]
Ji, Shuyi [3 ,4 ]
Saltzman, Alexander B. [5 ]
Jaehnig, Eric J. [1 ,2 ]
Lei, Jonathan T. [1 ,2 ]
Gao, Qiang [3 ,4 ]
Zhang, Bing [1 ,2 ]
机构
[1] Baylor Coll Med, Lester & Sue Smith Breast Ctr, Houston, TX 77030 USA
[2] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[3] Zhongshan Hosp, Liver Canc Inst, Dept Liver Surg & Transplantat, Shanghai, Peoples R China
[4] Fudan Univ, Key Lab Carcinogenesis & Canc Invas, Minist China, Shanghai, Peoples R China
[5] Baylor Coll Med, Adv Technol Cores, Mass Spectrometry Prote Core, Houston, TX USA
[6] Shanghai Jiao Tong Univ, Sch Life Sci & Biotechnol, Dept Bioinformat & Biostat, Shanghai 200240, Peoples R China
关键词
PEPTIDE IDENTIFICATION; PROTEOGENOMIC CHARACTERIZATION; PROTEIN-PHOSPHORYLATION; IN-VIVO; SITE; TANDEM; LOCALIZATION; RICH;
D O I
10.1016/j.mcpro.2023.100707
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Shotgun phosphoproteomics enables high-throughput analysis of phosphopeptides in biological samples. One of the primary challenges associated with this technology is the relatively low rate of phosphopeptide identification during data analysis. This limitation hampers the full realization of the potential offered by shotgun phosphoproteomics. Here we present DeepRescore2, a computational workflow that leverages deep learning-based retention time and fragment ion intensity predictions to improve phosphopeptide identification and phosphosite localization. Using a state-of-the-art computational workflow as a benchmark, DeepRescore2 increases the number of correctly identified peptide-spectrum matches by 17% in a synthetic dataset and identifies 19% to 46% more phosphopeptides in biological datasets. In a liver cancer dataset, 30% of the significantly altered phosphosites between tumor and normal tissues and 60% of the prognosis-associated phosphosites identified from DeepRescore2-processed data could not be identified based on the state-of-the-art workflow. Notably, DeepRescore2-processed data uniquely identifies EGFR hyperactivation as a new target in poor-prognosis liver cancer, which is validated experimentally. Integration of deep learning prediction in DeepRescore2 improves phosphopeptide identification and facilitates biological discoveries.
引用
收藏
页数:17
相关论文
共 37 条
  • [21] A Deep Learning-Based Model That Reduces Speed of Sound Aberrations for Improved In Vivo Photoacoustic Imaging
    Jeon, Seungwan
    Choi, Wonseok
    Park, Byullee
    Kim, Chulhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8773 - 8784
  • [22] A point cloud-based deep learning strategy for protein-ligand binding affinity prediction
    Wang, Yeji
    Wu, Shuo
    Duan, Yanwen
    Huang, Yong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [23] Small-scale location identification in natural environments with deep learning based on biomimetic sonar echoes
    Zhang, Liujun
    Farabow, Andrew
    Singhal, Pradyumann
    Müller, Rolf
    BIOINSPIRATION & BIOMIMETICS, 2023, 18 (02)
  • [24] Exploring protein-ligand binding affinity prediction with electron density-based geometric deep learning
    Isert, Clemens
    Atz, Kenneth
    Riniker, Sereina
    Schneider, Gisbert
    RSC ADVANCES, 2024, 14 (07) : 4492 - 4502
  • [25] Continual Prediction of Bug-Fix Time Using Deep Learning-Based Activity Stream Embedding
    Lee, Youngseok
    Lee, Suin
    Lee, Chan-Gun
    Yeom, Ikjun
    Woo, Honguk
    IEEE ACCESS, 2020, 8 : 10503 - 10515
  • [26] TransPhos: A Deep-Learning Model for General Phosphorylation Site Prediction Based on Transformer-Encoder Architecture
    Wang, Xun
    Zhang, Zhiyuan
    Zhang, Chaogang
    Meng, Xiangyu
    Shi, Xin
    Qu, Peng
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (08)
  • [27] Deep learning approaches based improved light weight U-Net with attention module for optic disc segmentation
    Shalini, R.
    Gopi, Varun P.
    PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2022, 45 (04) : 1111 - 1122
  • [28] Deep learning-based prediction of coronary artery calcium scoring in hemodialysis patients using radial artery calcification
    Xu, Yuankai
    Li, Wen
    Yang, Yanli
    Dong, Shiyi
    Meng, Fulei
    Zhang, Kaidi
    Wang, Yuhuan
    Ruan, Lin
    Zhang, Lihong
    SEMINARS IN DIALYSIS, 2024, 37 (03) : 234 - 241
  • [29] Identification and Localization of Quantum Electromagnetic Fields of Hardware Trojan Attacks using QDM-based Unsupervised Deep Learning
    Ghimire, Ashutosh
    Hossain, Al Amin
    Bhatta, Niraj Prasad
    Amsaad, Fathi
    2023 IEEE PHYSICAL ASSURANCE AND INSPECTION OF ELECTRONICS, PAINE, 2023, : 53 - 59
  • [30] NLOS identification using parallel deep learning model and time-frequency information in UWB-based positioning system
    Wei, Junyu
    Wang, Haowen
    Su, Shaojing
    Tang, Ying
    Guo, Xiaojun
    Sun, Xiaoyong
    MEASUREMENT, 2022, 195