Proteogenomics of Malignant Melanoma Cell Lines: The Effect of Stringency of Exome Data Filtering on Variant Peptide Identification in Shotgun Proteomics

被引:13
作者
Lobas, Anna A. [1 ,2 ]
Pyatnitskiy, Mikhail A. [3 ,4 ]
Chernobrovkin, Alexey L. [5 ]
Ilina, Irina Y. [3 ]
Karpov, Dmitry S. [3 ,6 ]
Solovyeva, Elizaveta M. [2 ]
Kuznetsova, Ksenia G. [3 ]
Ivanov, Mark V. [2 ]
Lyssuk, Elena Y. [7 ]
Kliuchnikova, Anna A. [3 ,8 ]
Voronko, Olga E. [3 ]
Larin, Sergey S. [7 ]
Zubarev, Roman A.
Gorshkov, Mikhail V. [2 ]
Moshkovskii, Sergei A. [3 ,8 ]
机构
[1] Moscow Inst Phys & Technol, Dolgoprudnyi 141700, Moscow Region, Russia
[2] Russian Acad Sci, VL Talrose Inst Energy Problems Chem Phys, Moscow 119334, Russia
[3] Inst Biomed Chem, Moscow 119121, Russia
[4] Higher Sch Econ, Moscow 101000, Russia
[5] Karolinska Inst, S-17177 Stockholm, Sweden
[6] Russian Acad Sci, Engelhardt Inst Mol Biol, Moscow 119991, Russia
[7] Russian Acad Sci, Inst Gene Biol, Moscow 119334, Russia
[8] Pirogov Russian Natl Res Med Univ, Moscow 117997, Russia
基金
俄罗斯科学基金会; 俄罗斯基础研究基金会;
关键词
proteogenomics; melanoma; cell line; cancer genome; next-generation sequencing; shotgun proteomics; data integration; missense mutation; MASS-SPECTROMETRY; CANCER; SEARCH; EXPRESSION; HETEROGENEITY; DATABASES; PROTEINS; RESOURCE; BIOLOGY; GENES;
D O I
10.1021/acs.jproteome.7b00841
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The identification of genetically encoded variants at the proteome level is an important problem in cancer proteogenomics. The generation of customized protein databases from DNA or RNA sequencing data is a crucial stage of the identification workflow. Genomic data filtering applied at this stage may significantly modify variant search results, yet its effect is generally left out of the scope of proteogenomic studies. In this work, we focused on this impact using data of exome sequencing and LC-MS/MS analyses of six replicates for eight melanoma cell lines processed by a proteogenomics workflow. The main objectives were identifying variant peptides and revealing the role of the genomic data filtering in the variant identification. A series of six confidence thresholds for single nucleotide polymorphisms and indels from the exome data were applied to generate customized sequence databases of different stringency. In the searches against unfiltered databases, between 100 and 160 variant peptides were identified for each of the cell lines using X!Tandem and MS-GF+search engines. The recovery rate for variant peptides was similar to 1%, which is approximately three times lower than that of the wild-type peptides. Using unfiltered genomic databases for variant searches resulted in higher sensitivity and selectivity of the proteogenomic workflow and positively affected the ability to distinguish the cell lines based on variant peptide signatures.
引用
收藏
页码:1801 / 1811
页数:11
相关论文
共 50 条
[1]   The Exomes of the NCI-60 Panel: A Genomic Resource for Cancer Biology and Systems Pharmacology [J].
Abaan, Ogan D. ;
Polley, Eric C. ;
Davis, Sean R. ;
Zhu, Yuelin J. ;
Bilke, Sven ;
Walker, Robert L. ;
Pineda, Marbin ;
Gindin, Yevgeniy ;
Jiang, Yuan ;
Reinhold, William C. ;
Holbeck, Susan L. ;
Simon, Richard M. ;
Doroshow, James H. ;
Pommier, Yves ;
Meltzer, Paul S. .
CANCER RESEARCH, 2013, 73 (14) :4372-4382
[2]   Genomic Classification of Cutaneous Melanoma [J].
Akbani, Rehan ;
Akdemir, Kadir C. ;
Aksoy, B. Arman ;
Albert, Monique ;
Ally, Adrian ;
Amin, Samirkumar B. ;
Arachchi, Harindra ;
Arora, Arshi ;
Auman, J. Todd ;
Ayala, Brenda ;
Baboud, Julien ;
Balasundaram, Miruna ;
Balu, Saianand ;
Barnabas, Nandita ;
Bartlett, John ;
Bartlett, Pam ;
Bastian, Boris C. ;
Baylin, Stephen B. ;
Behera, Madhusmita ;
Belyaev, Dmitry ;
Benz, Christopher ;
Bernard, Brady ;
Beroukhim, Rameen ;
Bir, Natalie ;
Black, Aaron D. ;
Bodenheimer, Tom ;
Boice, Lori ;
Boland, Genevieve M. ;
Bono, Riccardo ;
Bootwalla, Moiz S. ;
Bosenberg, Marcus ;
Bowen, Jay ;
Bowlby, Reanne ;
Bristow, Christopher A. ;
Brockway-Lunardi, Laura ;
Brooks, Denise ;
Brzezinski, Jakub ;
Bshara, Wiam ;
Buda, Elizabeth ;
Burns, William R. ;
Butterfield, Yaron S. N. ;
Button, Michael ;
Calderone, Tiffany ;
Cappellini, Giancarlo Antonini ;
Carter, Candace ;
Carter, Scott L. ;
Cherney, Lynn ;
Cherniack, Andrew D. ;
Chevalier, Aaron ;
Chin, Lynda .
CELL, 2015, 161 (07) :1681-1696
[3]   Detecting protein variants by mass spectrometry: a comprehensive study in cancer cell-lines [J].
Alfaro, Javier A. ;
Ignatchenko, Alexandr ;
Ignatchenko, Vladimir ;
Sinha, Ankit ;
Boutros, Paul C. ;
Kislinger, Thomas .
GENOME MEDICINE, 2017, 9
[4]  
Alfaro JA, 2014, NAT METHODS, V11, P1107, DOI [10.1038/NMETH.3138, 10.1038/nmeth.3138]
[5]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
[6]   Direct identification of clinically relevant neoepitopes presented on native human melanoma tissue by mass spectrometry [J].
Bassani-Sternberg, Michal ;
Braunlein, Eva ;
Klar, Richard ;
Engleitner, Thomas ;
Sinitcyn, Pavel ;
Audehm, Stefan ;
Straub, Melanie ;
Weber, Julia ;
Slotta-Huspenina, Julia ;
Specht, Katja ;
Martignoni, Marc E. ;
Werner, Angelika ;
Hein, Rudiger ;
Busch, Dirk H. ;
Peschel, Christian ;
Rad, Roland ;
Cox, Jurgen ;
Mann, Matthias ;
Krackhardt, Angela M. .
NATURE COMMUNICATIONS, 2016, 7
[7]   Spatial genomic heterogeneity within localized, multifocal prostate cancer [J].
Boutros, Paul C. ;
Fraser, Michael ;
Harding, Nicholas J. ;
de Borja, Richard ;
Trudel, Dominique ;
Lalonde, Emilie ;
Meng, Alice ;
Hennings-Yeomans, Pablo H. ;
McPherson, Andrew ;
Sabelnykova, Veronica Y. ;
Zia, Amin ;
Fox, Natalie S. ;
Livingstone, Julie ;
Shiah, Yu-Jia ;
Wang, Jianxin ;
Beck, Timothy A. ;
Have, Cherry L. ;
Chong, Taryne ;
Sam, Michelle ;
Johns, Jeremy ;
Timms, Lee ;
Buchner, Nicholas ;
Wong, Ada ;
Watson, John D. ;
Simmons, Trent T. ;
P'ng, Christine ;
Zafarana, Gaetano ;
Nguyen, Francis ;
Luo, Xuemei ;
Chu, Kenneth C. ;
Prokopec, Stephenie D. ;
Sykes, Jenna ;
Dal Pra, Alan ;
Berlin, Alejandro ;
Brown, Andrew ;
Chan-Seng-Yue, Michelle A. ;
Yousif, Fouad ;
Denroche, Robert E. ;
Chong, Lauren C. ;
Chen, Gregory M. ;
Jung, Esther ;
Fung, Clement ;
Starmans, Maud H. W. ;
Chen, Hanbo ;
Govind, Shaylan K. ;
Hawley, James ;
D'Costa, Alister ;
Pintilie, Melania ;
Waggott, Daryl ;
Hach, Faraz .
NATURE GENETICS, 2015, 47 (07) :736-+
[8]   Effective filtering strategies to improve data quality from population-based whole exome sequencing studies [J].
Carson, Andrew R. ;
Smith, Erin N. ;
Matsui, Hiroko ;
Braekkan, Sigrid K. ;
Jepsen, Kristen ;
Hansen, John-Bjarne ;
Frazer, Kelly A. .
BMC BIOINFORMATICS, 2014, 15
[9]   Genome-wide association study identifies 14 novel risk alleles associated with basal cell carcinoma [J].
Chahal, Harvind S. ;
Wu, Wenting ;
Ransohoff, Katherine J. ;
Yang, Lingyao ;
Hedlin, Haley ;
Desai, Manisha ;
Lin, Yuan ;
Dai, Hong-Ji ;
Qureshi, Abrar A. ;
Li, Wen-Qing ;
Kraft, Peter ;
Hinds, David A. ;
Tang, Jean Y. ;
Han, Jiali ;
Sarin, Kavita Y. .
NATURE COMMUNICATIONS, 2016, 7
[10]   A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides [J].
Chick, Joel M. ;
Kolippakkam, Deepak ;
Nusinow, David P. ;
Zhai, Bo ;
Rad, Ramin ;
Huttlin, Edward L. ;
Gygi, Steven P. .
NATURE BIOTECHNOLOGY, 2015, 33 (07) :743-749