Genomic variability and protein species - Improving sequence coverage for proteogenomics

被引:7
作者
Bischoff, Rainer [1 ]
Permentier, Hjalmar [1 ,2 ]
Guryev, Victor [3 ]
Horvatovich, Peter [1 ]
机构
[1] Univ Groningen, Univ Med Ctr Groningen, Res Inst Pharm, Dept Analyt Biochem, Antonius Deusinglaan 1, NL-9713 AV Groningen, Netherlands
[2] Univ Groningen, Univ Med Ctr Groningen, Res Inst Pharm, Interfac Mass Spectrometry Ctr, Antonius Deusinglaan 1, NL-9713 AV Groningen, Netherlands
[3] Univ Groningen, Univ Med Ctr Groningen, European Res Inst Biol Ageing, Antonius Deusinglaan 1, NL-9713 AV Groningen, Netherlands
关键词
Genomics; Proteomics; Mass spectrometry; Proteogenomics; DNA/RNA sequencing; COLLISION-INDUCED DISSOCIATION; ELECTRON-TRANSFER DISSOCIATION; MASS-SPECTROMETRY; ELECTROCHEMICAL OXIDATION; PEPTIDE IDENTIFICATION; LIQUID-CHROMATOGRAPHY; LOW-ENERGY; CHARGE DERIVATIZATION; PROTEOMICS; SPECTRA;
D O I
10.1016/j.jprot.2015.09.021
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein heterogeneity may result from many factors often closely related to the regulation of biological mechanisms. This review addresses one source of protein heterogeneity, the translation of genetic variability and transcriptional modulation to the protein level. We provide an overview how customized protein sequence databases generated using genomic and transcriptomic sequence information in conjunction with approaches to increase protein sequence coverage can aid in gaining a deeper insight into variability at the protein level. Modem approaches of DNA/RNA sequencing open the possibility to obtain detailed sequence information from individual genomes and transcriptomes at single nucleotide resolution. Further studies tried to correlate genetic variability with important biological consequences such as the risk for developing a disease or defining a personalized approach towards therapy (also called "personalized or precision medicine"). Linking genomic and transcriptomic information to complex biological mechanisms has, however, remained elusive due to the fact that there is no direct cause and effect relationship between changes at the DNA/RNA level and downstream consequences. In this review we give an overview of the challenges of integrating genomics and transcriptomics data with proteomics data and link variability at the DNA/RNA level to protein variability and protein species. Biological significance: The manuscript focuses on a recent trend in proteomics, namely the integration of genomic and proteomic data. Genetic and transcriptomic variability accounts for a considerable part of protein variability and is at the basis of many protein species, many of which not yet described at the protein level but many also identified as proteins or peptides with unknown function. The review highlights the challenges of current proteomics methodology, notably incomplete sequence coverage, which make it difficult to appreciate the full complexity of any proteome and leads to the fact that much variability at the DNA/RNA level is not captured at the protein level. We outline a few strategies to ameliorate this situation. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:25 / 36
页数:12
相关论文
共 95 条
[1]   The genome sequence of the spontaneously hypertensive rat: Analysis and functional significance [J].
Atanur, Santosh S. ;
Birol, Inanc ;
Guryev, Victor ;
Hirst, Martin ;
Hummel, Oliver ;
Morrissey, Catherine ;
Behmoaras, Jacques ;
Fernandez-Suarez, Xose M. ;
Johnson, Michelle D. ;
McLaren, William M. ;
Patone, Giannino ;
Petretto, Enrico ;
Plessy, Charles ;
Rockland, Kathleen S. ;
Rockland, Charles ;
Saar, Kathrin ;
Zhao, Yongjun ;
Carninci, Piero ;
Flicek, Paul ;
Kurtz, Ted ;
Cuppen, Edwin ;
Pravenec, Michal ;
Hubner, Norbert ;
Jones, Steven J. M. ;
Birney, Ewan ;
Aitman, Timothy J. .
GENOME RESEARCH, 2010, 20 (06) :791-803
[2]   Characterization of the human ESC transcriptome by hybrid sequencing [J].
Au, Kin Fai ;
Sebastiano, Vittorio ;
Afshar, Pegah Tootoonchi ;
Durruthy, Jens Durruthy ;
Lee, Lawrence ;
Williams, Brian A. ;
van Bakel, Harm ;
Schadt, Eric E. ;
Reijo-Pera, Renee A. ;
Underwood, Jason G. ;
Wong, Wing Hung .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (50) :E4821-E4830
[3]   Predictions of peptides' retention times in reversed-phase liquid chromatography as a new supportive tool to improve protein identification in proteomics [J].
Baczek, Tomasz ;
Kaliszan, Roman .
PROTEOMICS, 2009, 9 (04) :835-847
[4]   Gene Expression Regulation by Upstream Open Reading Frames and Human Disease [J].
Barbosa, Cristina ;
Peixeiro, Isabel ;
Romao, Luisa .
PLOS GENETICS, 2013, 9 (08)
[5]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[6]   Processing strategies and software solutions for data-independent acquisition in mass spectrometry [J].
Bilbao, Aivett ;
Varesio, Emmanuel ;
Luban, Jeremy ;
Strambio-De-Castillia, Caterina ;
Hopfgartner, Gerard ;
Mueller, Markus ;
Lisacek, Frederique .
PROTEOMICS, 2015, 15 (5-6) :964-980
[7]  
Branca RMM, 2014, NAT METHODS, V11, P59, DOI [10.1038/nmeth.2732, 10.1038/NMETH.2732]
[8]   Guanidino labeling derivatization strategy for global characterization of peptide mixtures by liquid chromatography matrix-assisted laser desorption/ionization mass spectrometry [J].
Brancia, FL ;
Montgomery, H ;
Tanaka, K ;
Kumashiro, S .
ANALYTICAL CHEMISTRY, 2004, 76 (10) :2748-2755
[9]   Cleavage N-terminal to proline: Analysis of a database of peptide tandem mass spectra [J].
Breci, LA ;
Tabb, DL ;
Yates, JR ;
Wysocki, VH .
ANALYTICAL CHEMISTRY, 2003, 75 (09) :1963-1971
[10]   Deep proteogenomics; high throughput gene validation by multidimensional liquid chromatography and mass spectrometry of proteins from the fungal wheat pathogen Stagonospora nodorum [J].
Bringans, Scott ;
Hane, James K. ;
Casey, Tammy ;
Tan, Kar-Chun ;
Lipscombe, Richard ;
Solomon, Peter S. ;
Oliver, Richard P. .
BMC BIOINFORMATICS, 2009, 10 :301