A mass graph-based approach for the identification of modified proteoforms using top-down tandem mass spectra

被引:30
|
作者
Kou, Qiang [1 ]
Wu, Si [2 ]
Tolic, Nikola [3 ]
Pasa-Tolic, Ljiljana [3 ]
Liu, Yunlong [4 ,5 ]
Liu, Xiaowen [1 ,5 ]
机构
[1] Indiana Univ Purdue Univ, Dept BioHlth Informat, Indianapolis, IN 46202 USA
[2] Univ Oklahoma, Dept Chem & Biochem, Norman, OK 73019 USA
[3] Pacific Northwest Natl Lab, Environm Mol Sci Lab, Richland, WA 99354 USA
[4] Indiana Univ Sch Med, Dept Med & Mol Genet, Indianapolis, IN 46202 USA
[5] Indiana Univ Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
基金
美国国家卫生研究院;
关键词
PROTEIN IDENTIFICATION; POSTTRANSLATIONAL MODIFICATIONS; SITE LOCALIZATION; SPECTROMETRY; ALIGNMENT; PEPTIDES; SEARCH;
D O I
10.1093/bioinformatics/btw806
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Although proteomics has rapidly developed in the past decade, researchers are still in the early stage of exploring the world of complex proteoforms, which are protein products with various primary structure alterations resulting from gene mutations, alternative splicing, post-translational modifications, and other biological processes. Proteoform identification is essential to mapping proteoforms to their biological functions as well as discovering novel proteoforms and new protein functions. Top-down mass spectrometry is the method of choice for identifying complex proteoforms because it provides a 'bird's eye view' of intact proteoforms. The combinatorial explosion of various alterations on a protein may result in billions of possible proteoforms, making proteoform identification a challenging computational problem. Results: We propose a new data structure, called the mass graph, for efficient representation of proteoforms and design mass graph alignment algorithms. We developed TopMG, a mass graph-based software tool for proteoform identification by top-down mass spectrometry. Experiments on top-down mass spectrometry datasets showed that TopMG outperformed existing methods in identifying complex proteoforms.
引用
收藏
页码:1309 / 1316
页数:8
相关论文
共 50 条
  • [21] De Novo Sequencing of Peptides from High-Resolution Bottom-Up Tandem Mass Spectra using Top-Down Intended Methods
    Vyatkina, Kira
    Dekker, Lennard J. M.
    Wu, Si
    VanDuijn, Martijn M.
    Liu, Xiaowen
    Tolic, Nikola
    Luider, Theo M.
    Pasa-Tolic, Ljiljana
    PROTEOMICS, 2017, 17 (23-24)
  • [22] Does deamidation cause protein unfolding? A top-down tandem mass spectrometry study
    Soulby, Andrew J.
    Heal, Jack W.
    Barrow, Mark P.
    Roemer, Rudolf A.
    O'Connor, Peter B.
    PROTEIN SCIENCE, 2015, 24 (05) : 850 - 860
  • [23] Proteoform Identification by Combining RNA-Seq and Top-Down Mass Spectrometry
    Chen, Wenrong
    Liu, Xiaowen
    JOURNAL OF PROTEOME RESEARCH, 2021, 20 (01) : 261 - 269
  • [24] Characterization of Human Sperm Protamine Proteoforms through a Combination of Top-Down and Bottom-Up Mass Spectrometry Approaches
    Soler-Ventura, Ada
    Gay, Marina
    Jodar, Meritxell
    Vilanova, Mar
    Castillo, Judit
    Arauz-Garofalo, Gianluca
    Villarreal, Laura
    Ballesca, Josep Lluis
    Vilaseca, Marta
    Oliva, Rafael
    JOURNAL OF PROTEOME RESEARCH, 2020, 19 (01) : 221 - 237
  • [25] Top-Down Characterization of Heavily Modified Histones Using 193 nm Ultraviolet Photodissociation Mass Spectrometry
    Greer, Sylvester M.
    Brodbelt, Jennifer S.
    JOURNAL OF PROTEOME RESEARCH, 2018, 17 (03) : 1138 - 1145
  • [26] Systematic Evaluation of Protein Sequence Filtering Algorithms for Proteoform Identification Using Top-Down Mass Spectrometry
    Kou, Qiang
    Wu, Si
    Liu, Xiaowen
    PROTEOMICS, 2018, 18 (3-4)
  • [27] Quantitative Top-Down Proteomics by Isobaric Labeling with Thiol-Directed Tandem Mass Tags
    Winkels, Konrad
    Koudelka, Tomas
    Tholey, Andreas
    JOURNAL OF PROTEOME RESEARCH, 2021, 20 (09) : 4495 - 4506
  • [28] The Current and Future State of Top-Down Protein Sequencing by ESI-Tandem Mass Spectrometry
    Loo, J. A.
    Krull, I. S.
    Rathore, A.
    LC GC NORTH AMERICA, 2016, 34 (07) : 492 - 499
  • [29] Combining SDS-PAGE to capillary zone electrophoresis-tandem mass spectrometry for high-resolution top-down proteomics analysis of intact histone proteoforms
    Fang, Fei
    Gao, Guangyao
    Wang, Qianyi
    Wang, Qianjie
    Sun, Liangliang
    PROTEOMICS, 2024, 24 (17)
  • [30] New Algorithm for the Identification of Intact Disulfide Linkages Based on Fragmentation Characteristics in Tandem Mass Spectra
    Choi, Seonhwa
    Jeong, Jaeho
    Na, Seungjin
    Lee, Hyo Sun
    Kim, Hwa-Young
    Lee, Kong-Joo
    Paek, Eunok
    JOURNAL OF PROTEOME RESEARCH, 2010, 9 (01) : 626 - 635