Data Management of Sensitive Human Proteomics Data: Current Practices, Recommendations, and Perspectives for the Future

被引:26
作者
Bandeira, Nuno [1 ,2 ,3 ]
Deutsch, Eric W. [4 ]
Kohlbacher, Oliver [5 ,6 ,7 ,8 ]
Martens, Lennart [9 ,10 ]
Vizcaino, Juan Antonio [11 ]
机构
[1] Univ Calif San Diego UCSD, Ctr Computat Mass Spectrometry, La Jolla, CA USA
[2] Univ Calif San Diego UCSD, Dept Comp Sci & Engn, La Jolla, CA USA
[3] Univ Calif San Diego UCSD, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA USA
[4] Inst Syst Biol, Seattle, WA USA
[5] Univ Tubingen, Inst Bioinformat & Med Informat, Tubingen, Germany
[6] Univ Tubingen, Quantitat Biol Ctr, Tubingen, Germany
[7] Max Planck Inst Dev Biol, Biomol Interact, Tubingen, Germany
[8] Univ Hosp Tubingen, Inst Translat Bioinformat, Tubingen, Germany
[9] VIB, VIB UGent Ctr Med Biotechnol, Ghent, Belgium
[10] Univ Ghent, Dept Biomol Med, Ghent, Belgium
[11] European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Wellcome Trust Genome Campus, Cambridge, England
基金
欧盟地平线“2020”; 英国生物技术与生命科学研究理事会; 英国惠康基金;
关键词
IDENTIFICATION; PEPTIDES; GENOMICS; ARCHIVE; ATLAS;
D O I
10.1016/j.mcpro.2021.100071
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Today it is the norm that all relevant proteomics data that support the conclusions in scientific publications are made available in public proteomics data repositories. However, given the increase in the number of clinical proteomics studies, an important emerging topic is the management and dissemination of clinical, and thus potentially sensitive, human proteomics data. Both in the United States and in the European Union, there are legal frameworks protecting the privacy of individuals. Implementing privacy standards for publicly released research data in genomics and transcriptomics has led to processes to control who may access the data, so-called "controlled access" data. In parallel with the technological developments in the field, it is clear that the privacy risks of sharing proteomics data need to be properly assessed and managed. In our view, the proteomics community must be proactive in addressing these issues. Yet a careful balance must be kept. On the one hand, neglecting to address the potential of identifiability in human proteomics data could lead to reputational damage of the field, while on the other hand, erecting barriers to open access to clinical proteomics data will inevitably reduce reuse of proteomics data and could substantially delay critical discoveries in biomedical research. In order to balance these apparently conflicting requirements for data privacy and efficient use and reuse of research efforts through the sharing of clinical proteomics data, development efforts will be needed at different levels including bioinformatics infrastructure, policymaking, and mechanisms of oversight.
引用
收藏
页数:9
相关论文
共 41 条
[1]   New Guidelines for Publication of Manuscripts Describing Development and Application of Targeted Mass Spectrometry Measurements of Peptides and Proteins [J].
Abbatiello, Susan ;
Ackermann, Bradley L. ;
Borchers, Christoph ;
Bradshaw, Ralph A. ;
Carr, Steven A. ;
Chalkley, Robert ;
Choi, Meena ;
Deutsch, Eric ;
Domon, Bruno ;
Hoofnagle, Andrew N. ;
Keshishian, Hasmik ;
Kuhn, Eric ;
Liebler, Daniel C. ;
MacCoss, Michael ;
MacLean, Brendan ;
Mani, D. R. ;
Neubert, Hendrik ;
Smith, Derek ;
Vitek, Olga ;
Zimmerman, Lisa .
MOLECULAR & CELLULAR PROTEOMICS, 2017, 16 (03) :327-328
[2]  
[Anonymous], 2013, Federal register, V78, P5566
[3]  
[Anonymous], 2017, RED REGIST
[4]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[5]   ArrayExpress update - from bulk to single-cell expression data [J].
Athar, Awais ;
Fullgrabe, Anja ;
George, Nancy ;
Iqbal, Haider ;
Huerta, Laura ;
Ali, Ahmed ;
Snow, Catherine ;
Fonseca, Nuno A. ;
Petryszak, Robert ;
Papatheodorou, Irene ;
Sarkans, Ugis ;
Brazma, Alvis .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D711-D715
[6]   NCBI GEO: archive for functional genomics data sets-update [J].
Barrett, Tanya ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Evangelista, Carlos ;
Kim, Irene F. ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Holko, Michelle ;
Yefanov, Andrey ;
Lee, Hyeseung ;
Zhang, Naigong ;
Robertson, Cynthia L. ;
Serova, Nadezhda ;
Davis, Sean ;
Soboleva, Alexandra .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D991-D995
[7]   Proteomics Standards Initiative Extended FASTA Format [J].
Binz, Pierre-Alain ;
Shofstahl, Jim ;
Vizcaino, Juan Antonio ;
Barsnes, Harald ;
Chalkley, Robert J. ;
Menschaert, Gerben ;
Alpi, Emanuele ;
Clauser, Karl ;
Eng, Jimmy K. ;
Lane, Lydie ;
Seymour, Sean L. ;
Sanchez, Luis Francisco Hernandez ;
Mayer, Gerhard ;
Eisenacher, Martin ;
Perez-Riverol, Yasset ;
Kapp, Eugene A. ;
Mendoza, Luis ;
Baker, Peter R. ;
Collins, Andrew ;
Van den Bossche, Tim ;
Deutsch, Eric W. .
JOURNAL OF PROTEOME RESEARCH, 2019, 18 (06) :2686-2692
[8]   The Age of Data-Driven Proteomics: How Machine Learning Enables Novel Workflows [J].
Bouwmeester, Robbin ;
Gabriels, Ralf ;
Van Den Bossche, Tim ;
Martens, Lennart ;
Degroeve, Sven .
PROTEOMICS, 2020, 20 (21-22)
[9]   Reporting protein identification data - The next generation of guidelines [J].
Bradshaw, RA ;
Burlingame, AL ;
Carr, S ;
Aebersold, R .
MOLECULAR & CELLULAR PROTEOMICS, 2006, 5 (05) :787-788
[10]   Hair Proteome Variation at Different Body Locations on Genetically Variant Peptide Detection for Protein-Based Human Identification [J].
Chu, Fanny ;
Mason, Katelyn E. ;
Anex, Deon S. ;
Jones, A. Daniel ;
Hart, Bradley R. .
SCIENTIFIC REPORTS, 2019, 9 (1)