Mitigating the missing-fragmentation problem in de novo peptide sequencing with a two-stage graph-based deep learning model

被引:12
|
作者
Mao Z. [1 ]
Zhang R. [1 ]
Xin L. [2 ]
Li M. [1 ,3 ,4 ]
机构
[1] Cheriton School of Computer Science, University of Waterloo, Waterloo, ON
[2] Bioinformatics Solutions Inc., Waterloo, ON
[3] Henan Academy of Sciences, Henan
[4] Shanghai Institute of Mathematical and Interdisciplinary Sciences, Shanghai
基金
加拿大自然科学与工程研究理事会;
关键词
Compendex;
D O I
10.1038/s42256-023-00738-x
中图分类号
学科分类号
摘要
Novel protein discovery and immunopeptidomics depend on highly sensitive de novo peptide sequencing with tandem mass spectrometry. Despite notable improvement using deep learning models, the missing-fragmentation problem remains an important hurdle that severely degrades the performance of de novo peptide sequencing. Here we reveal that in the process of peptide prediction, missing fragmentation results in the generation of incorrect amino acids within those regions and causes error accumulation thereafter. To tackle this problem, we propose GraphNovo, a two-stage de novo peptide-sequencing algorithm based on a graph neural network. GraphNovo focuses on finding the optimal path in the first stage to guide the sequence prediction in the second stage. Our experiments demonstrate that GraphNovo mitigates the effects of missing fragmentation and outperforms the state-of-the-art de novo peptide-sequencing algorithms. © 2023, Crown.
引用
收藏
页码:1250 / 1260
页数:10
相关论文
empty
未找到相关数据