PGPointNovo: an efficient neural network-based tool for parallel de novo peptide sequencing

被引:3
|
作者
Xu, Xiaofang [1 ]
Yang, Chunde [1 ]
He, Qiang [3 ]
Shu, Kunxian [5 ]
Xinpu, Yuan [6 ]
Chen, Zhiguang [4 ]
Zhu, Yunping [2 ]
Chen, Tao [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Chongqing 400065, Peoples R China
[2] Beijing Inst Life, Beijing Proteome Res Ctr, Natl Ctr Prot Sci Beijing, State Key Lab Prote, Beijing 102206, Peoples R China
[3] Swinburne Univ Technol, Sch Software & Elect Engn, Melbourne, Vic 3122, Australia
[4] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 26469, Peoples R China
[5] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Big Data Bio Intelligence, Chongqing 400065, Peoples R China
[6] Chinese Peoples Liberat Army Gen Hosp, Med Ctr 1, Dept Gen Surg, Beijing, Peoples R China
来源
BIOINFORMATICS ADVANCES | 2023年 / 3卷 / 01期
关键词
CANCER; NUMBER; VALIDATION; CLUSTERS;
D O I
10.1093/bioadv/vbad057
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
De novo peptide sequencing for tandem mass spectrometry data is not only a key technology for novel peptide identification, but also a precedent task for many downstream tasks, such as vaccine and antibody studies. In recent years, neural network models for de novo peptide sequencing have manifested a remarkable ability to accommodate various data sources and outperformed conventional peptide identification tools. However, the excellent model is computationally expensive, taking up to 1 week to process about 400 000 spectrums. This article presents PGPointNovo, a novel neural network-based tool for parallel de novo peptide sequencing. PGPointNovo uses data parallelization technology to accelerate training and inference and optimizes the training obstacles caused by large batch sizes. The results of extensive experiments conducted on multiple datasets of different sizes demonstrate that compared with PointNovo the excellent neural network-based de novo peptide sequencing tool, PGPointNovo, accelerates de novo peptide sequencing by up to 7.35x without precision or recall compromises.
引用
收藏
页数:3
相关论文
共 50 条
  • [41] NovoRank: Refinement for De Novo Peptide Sequencing Based on Spectral Clustering and Deep Learning
    Seo, Jangho
    Choi, Seunghyuk
    Paek, Eunok
    JOURNAL OF PROTEOME RESEARCH, 2024, 24 (02) : 903 - 910
  • [42] An effective method for de novo peptide sequencing based on phosphorylation strategy and mass spectrometry
    Zhang, Dongmei
    Liu, Hongxia
    Zhang, Shusheng
    Chen, Xiaolan
    Li, Shangfu
    Zhang, Cunlong
    Hu, Xiangming
    Bi, Kaishun
    Chen, Xiaohui
    Jiang, Yuyang
    TALANTA, 2011, 84 (03) : 614 - 622
  • [43] Bidirectional de novo peptide sequencing using a transformer model
    Lee, Sangjeong
    Kim, Hyunwoo
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (02)
  • [44] NovoHCD: De novo Peptide Sequencing From HCD Spectra
    Yan, Yan
    Kusalik, Anthony J.
    Wu, Fang-Xiang
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (02) : 65 - 72
  • [45] Probabilistic de novo peptide sequencing with doubly charged ions
    Peter, Hansruedi
    Fischer, Bernd
    Buhmann, Joachim M.
    PATTERN RECOGNITION, PROCEEDINGS, 2006, 4174 : 424 - 433
  • [46] NovoExD: De novo Peptide Sequencing for ETD/ECD Spectra
    Yan, Yan
    Kusalik, Anthony J.
    Wu, Fang-Xiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (02) : 337 - 344
  • [47] De Novo Mass Spectrometry Peptide Sequencing with a Transformer Model
    Yilmaz, Melih
    Fondrie, William E.
    Bittremieux, Wout
    Oh, Sewoong
    Noble, William Stafford
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [48] NovoHMM: A hidden Markov model for de novo peptide sequencing
    Fischer, B
    Roth, V
    Roos, F
    Grossmann, J
    Baginsky, S
    Widmayer, P
    Gruissem, W
    Buhmann, JM
    ANALYTICAL CHEMISTRY, 2005, 77 (22) : 7265 - 7273
  • [49] LESSONS IN DE NOVO PEPTIDE SEQUENCING BY TANDEM MASS SPECTROMETRY
    Medzihradszky, Katalin F.
    Chalkley, Robert J.
    MASS SPECTROMETRY REVIEWS, 2015, 34 (01) : 43 - 63
  • [50] Tandem mass intensity estimation for de novo peptide sequencing
    Loukil, Hatem
    2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2018, : 91 - 96