Deep neural network for detecting arbitrary precision peptide features through attention based segmentation

被引:0
作者
Fatema Tuz Zohora
M. Ziaur Rahman
Ngoc Hieu Tran
Lei Xin
Baozhen Shan
Ming Li
机构
[1] University of Waterloo,David R. Cheriton School of Computer Science
[2] Bioinformatics Solutions Inc.,undefined
来源
Scientific Reports | / 11卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
A promising technique of discovering disease biomarkers is to measure the relative protein abundance in multiple biofluid samples through liquid chromatography with tandem mass spectrometry (LC-MS/MS) based quantitative proteomics. The key step involves peptide feature detection in the LC-MS map, along with its charge and intensity. Existing heuristic algorithms suffer from inaccurate parameters and human errors. As a solution, we propose PointIso, the first point cloud based arbitrary-precision deep learning network to address this problem. It consists of attention based scanning step for segmenting the multi-isotopic pattern of 3D peptide features along with the charge, and a sequence classification step for grouping those isotopes into potential peptide features. PointIso achieves 98% detection of high-quality MS/MS identified peptide features in a benchmark dataset. Next, the model is adapted for handling the additional ‘ion mobility’ dimension and achieves 4% higher detection than existing algorithms on the human proteome dataset. Besides contributing to the proteomics study, our novel segmentation technique should serve the general object detection domain as well.
引用
收藏
相关论文
共 53 条
  • [1] LeCun Y(2015)Deep learning Nature 521 436-1941
  • [2] Bengio Y(2006)Pepper, a platform for experimental proteomic pattern recognition Mol. Cell. Proteomics 5 1927-10
  • [3] Hinton G(2016)Complete de novo assembly of monoclonal antibody sequences Sci. Rep. 6 1-66
  • [4] Jaffe JD(2019)Deep learning using tumor HLA peptide mass spectrometry datasets improves neoantigen identification Nat. Biotechnol. 37 55-771
  • [5] Tran NH(2014)A simple peak detection and label-free quantitation algorithm for chromatography-mass spectrometry BMC Bioinform. 15 376-710
  • [6] Bulik-Sullivan B(2019)Deep learning enables de novo peptide sequencing from data-independent-acquisition mass spectrometry Nat. Methods 16 63-13
  • [7] Aoshima K(2020)Personalized deep learning of individual immunopeptidomes to identify neoantigens for cancer vaccines Nat. Mach. Intell. 2 764-8
  • [8] Tran NH(2020)Improved protein structure prediction using potentials from deep learning Nature 577 706-687
  • [9] Tran NH(2019)Deepiso: A deep learning model for peptide feature detection from LC-MS map Sci. Rep. 9 1-3773
  • [10] Senior AW(2003)Mass spectrometry-based proteomics Nature 422 198-1372