Deep neural network for detecting arbitrary precision peptide features through attention based segmentation
被引:6
|
作者:
Zohora, Fatema Tuz
论文数: 0引用数: 0
h-index: 0
机构:
Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, CanadaUniv Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
Zohora, Fatema Tuz
[1
]
Rahman, M. Ziaur
论文数: 0引用数: 0
h-index: 0
机构:
Bioinformat Solut Inc, Waterloo, ON N2L 6J2, CanadaUniv Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
Rahman, M. Ziaur
[2
]
Tran, Ngoc Hieu
论文数: 0引用数: 0
h-index: 0
机构:
Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, CanadaUniv Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
Tran, Ngoc Hieu
[1
]
Xin, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Bioinformat Solut Inc, Waterloo, ON N2L 6J2, CanadaUniv Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
Xin, Lei
[2
]
Shan, Baozhen
论文数: 0引用数: 0
h-index: 0
机构:
Bioinformat Solut Inc, Waterloo, ON N2L 6J2, CanadaUniv Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
Shan, Baozhen
[2
]
Li, Ming
论文数: 0引用数: 0
h-index: 0
机构:
Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, CanadaUniv Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
Li, Ming
[1
]
机构:
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[2] Bioinformat Solut Inc, Waterloo, ON N2L 6J2, Canada
A promising technique of discovering disease biomarkers is to measure the relative protein abundance in multiple biofluid samples through liquid chromatography with tandem mass spectrometry (LC-MS/MS) based quantitative proteomics. The key step involves peptide feature detection in the LC-MS map, along with its charge and intensity. Existing heuristic algorithms suffer from inaccurate parameters and human errors. As a solution, we propose PointIso, the first point cloud based arbitrary-precision deep learning network to address this problem. It consists of attention based scanning step for segmenting the multi-isotopic pattern of 3D peptide features along with the charge, and a sequence classification step for grouping those isotopes into potential peptide features. PointIso achieves 98% detection of high-quality MS/MS identified peptide features in a benchmark dataset. Next, the model is adapted for handling the additional 'ion mobility' dimension and achieves 4% higher detection than existing algorithms on the human proteome dataset. Besides contributing to the proteomics study, our novel segmentation technique should serve the general object detection domain as well.