Deep Learning Prediction Boosts Phosphoproteomics-Based Discoveries Through Improved Phosphopeptide Identification

被引:6
|
作者
Yi, Xinpei [1 ,2 ,6 ]
Wen, Bo [1 ,2 ]
Ji, Shuyi [3 ,4 ]
Saltzman, Alexander B. [5 ]
Jaehnig, Eric J. [1 ,2 ]
Lei, Jonathan T. [1 ,2 ]
Gao, Qiang [3 ,4 ]
Zhang, Bing [1 ,2 ]
机构
[1] Baylor Coll Med, Lester & Sue Smith Breast Ctr, Houston, TX 77030 USA
[2] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[3] Zhongshan Hosp, Liver Canc Inst, Dept Liver Surg & Transplantat, Shanghai, Peoples R China
[4] Fudan Univ, Key Lab Carcinogenesis & Canc Invas, Minist China, Shanghai, Peoples R China
[5] Baylor Coll Med, Adv Technol Cores, Mass Spectrometry Prote Core, Houston, TX USA
[6] Shanghai Jiao Tong Univ, Sch Life Sci & Biotechnol, Dept Bioinformat & Biostat, Shanghai 200240, Peoples R China
关键词
PEPTIDE IDENTIFICATION; PROTEOGENOMIC CHARACTERIZATION; PROTEIN-PHOSPHORYLATION; IN-VIVO; SITE; TANDEM; LOCALIZATION; RICH;
D O I
10.1016/j.mcpro.2023.100707
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Shotgun phosphoproteomics enables high-throughput analysis of phosphopeptides in biological samples. One of the primary challenges associated with this technology is the relatively low rate of phosphopeptide identification during data analysis. This limitation hampers the full realization of the potential offered by shotgun phosphoproteomics. Here we present DeepRescore2, a computational workflow that leverages deep learning-based retention time and fragment ion intensity predictions to improve phosphopeptide identification and phosphosite localization. Using a state-of-the-art computational workflow as a benchmark, DeepRescore2 increases the number of correctly identified peptide-spectrum matches by 17% in a synthetic dataset and identifies 19% to 46% more phosphopeptides in biological datasets. In a liver cancer dataset, 30% of the significantly altered phosphosites between tumor and normal tissues and 60% of the prognosis-associated phosphosites identified from DeepRescore2-processed data could not be identified based on the state-of-the-art workflow. Notably, DeepRescore2-processed data uniquely identifies EGFR hyperactivation as a new target in poor-prognosis liver cancer, which is validated experimentally. Integration of deep learning prediction in DeepRescore2 improves phosphopeptide identification and facilitates biological discoveries.
引用
收藏
页数:17
相关论文
共 37 条
  • [31] Identification of Flow Pressure-Driven Leakage Zones Using Improved EDNN-PP-LCNetV2 with Deep Learning Framework in Water Distribution System
    Dong, Bo
    Shu, Shihu
    Li, Dengxin
    PROCESSES, 2024, 12 (09)
  • [32] Data-driven algorithm based on the scaled boundary finite element method and deep learning for the identification of multiple cracks in massive structures
    Jiang, Shouyan
    Deng, Wangtao
    Ooi, Ean Tat
    Sun, Liguo
    Du, Chengbin
    COMPUTERS & STRUCTURES, 2024, 291
  • [33] Tracking multiple construction workers through deep learning and the gradient based method with re-matching based on multi-object tracking accuracy
    Angah, Ohay
    Chen, Albert Y.
    AUTOMATION IN CONSTRUCTION, 2020, 119
  • [34] Identification of the Shaft-Rate Electromagnetic Field Induced by a Moving Ship Using Improved Learning-Based and Spectral-Direction Methods
    Hu, Shuanggui
    Zhang, Liang
    Tang, Jingtian
    Li, Guang
    Yang, Haiyan
    Xu, Zhenhuan
    Zhang, Lincheng
    Xiang, Jingnian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [35] Deep Learning-Based Location Spoofing Attack Detection and Time-of-Arrival Estimation through Power Received in IoT Networks
    Aldosari, Waleed
    SENSORS, 2023, 23 (23)
  • [36] A Deep Learning-Based Structural Damage Identification Method Integrating CNN-BiLSTM-Attention for Multi-Order Frequency Data Analysis
    Pei, Xue-Yang
    Hou, Yuan
    Huang, Hai-Bin
    Zheng, Jun-Xing
    BUILDINGS, 2025, 15 (05)
  • [37] PScL-HDeep: image-based prediction of protein subcellular location in human tissue using ensemble learning of handcrafted and deep learned features with two-layer feature selection
    Ullah, Matee
    Han, Ke
    Hadi, Fazal
    Xu, Jian
    Song, Jiangning
    Yu, Dong-Jun
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)