NGS-Logistics: federated analysis of NGS sequence variants across multiple locations

被引:16
作者
Ardeshirdavani, Amin [1 ,2 ]
Souche, Erika [3 ]
Dehaspe, Luc [3 ]
Van Houdt, Jeroen [3 ]
Vermeesch, Joris Robert [3 ]
Moreau, Yves [1 ,2 ]
机构
[1] Katholieke Univ Leuven, STADIUS Ctr Dynam Syst Signal Proc & Data Analyt, Dept Elect Engn ESAT, B-3001 Leuven, Belgium
[2] iMinds Med IT Dept, B-3001 Leuven, Belgium
[3] Katholieke Univ Leuven, Ctr Human Genet Gasthuisberg, B-3000 Leuven, Belgium
关键词
DATABASE; FORMAT;
D O I
10.1186/s13073-014-0071-9
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As many personal genomes are being sequenced, collaborative analysis of those genomes has become essential. However, analysis of personal genomic data raises important privacy and confidentiality issues. We propose a methodology for federated analysis of sequence variants from personal genomes. Specific base-pair positions and/or regions are queried for samples to which the user has access but also for the whole population. The statistics results do not breach data confidentiality but allow further exploration of the data; researchers can negotiate access to relevant samples through pseudonymous identifiers. This approach minimizes the impact on data confidentiality while enabling powerful data analysis by gaining access to important rare samples.
引用
收藏
页数:11
相关论文
共 14 条
[1]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[2]   Genomes by the thousand [J].
不详 .
NATURE, 2010, 467 (7319) :1026-1027
[3]   The Genome of the Netherlands: design, and project goals [J].
Boomsma, Dorret I. ;
Wijmenga, Cisca ;
Slagboom, Eline P. ;
Swertz, Morris A. ;
Karssen, Lennart C. ;
Abdellaoui, Abdel ;
Ye, Kai ;
Guryev, Victor ;
Vermaat, Martijn ;
van Dijk, Freerk ;
Francioli, Laurent C. ;
Hottenga, Jouke Jan ;
Laros, Jeroen F. J. ;
Li, Qibin ;
Li, Yingrui ;
Cao, Hongzhi ;
Chen, Ruoyan ;
Du, Yuanping ;
Li, Ning ;
Cao, Sujie ;
van Setten, Jessica ;
Menelaou, Androniki ;
Pulit, Sara L. ;
Hehir-Kwa, Jayne Y. ;
Beekman, Marian ;
Elbers, Clara C. ;
Byelas, Heorhiy ;
de Craen, Anton J. M. ;
Deelen, Patrick ;
Dijkstra, Martijn ;
den Dunnen, Johan T. ;
de Knijff, Peter ;
Houwing-Duistermaat, Jeanine ;
Koval, Vyacheslav ;
Estrada, Karol ;
Hofman, Albert ;
Kanterakis, Alexandros ;
van Enckevort, David ;
Mai, Hailiang ;
Kattenberg, Mathijs ;
van Leeuwen, Elisabeth M. ;
Neerincx, Pieter B. T. ;
Oostra, Ben ;
Rivadeneira, Fernanodo ;
Suchiman, Eka H. D. ;
Uitterlinden, Andre G. ;
Willemsen, Gonneke ;
Wolffenbuttel, Bruce H. ;
Wang, Jun ;
de Bakker, Paul I. W. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2014, 22 (02) :221-227
[4]   The variant call format and VCFtools [J].
Danecek, Petr ;
Auton, Adam ;
Abecasis, Goncalo ;
Albers, Cornelis A. ;
Banks, Eric ;
DePristo, Mark A. ;
Handsaker, Robert E. ;
Lunter, Gerton ;
Marth, Gabor T. ;
Sherry, Stephen T. ;
McVean, Gilean ;
Durbin, Richard .
BIOINFORMATICS, 2011, 27 (15) :2156-2158
[5]   The UCSC Genome Browser database: update 2011 [J].
Fujita, Pauline A. ;
Rhead, Brooke ;
Zweig, Ann S. ;
Hinrichs, Angie S. ;
Karolchik, Donna ;
Cline, Melissa S. ;
Goldman, Mary ;
Barber, Galt P. ;
Clawson, Hiram ;
Coelho, Antonio ;
Diekhans, Mark ;
Dreszer, Timothy R. ;
Giardine, Belinda M. ;
Harte, Rachel A. ;
Hillman-Jackson, Jennifer ;
Hsu, Fan ;
Kirkup, Vanessa ;
Kuhn, Robert M. ;
Learned, Katrina ;
Li, Chin H. ;
Meyer, Laurence R. ;
Pohl, Andy ;
Raney, Brian J. ;
Rosenbloom, Kate R. ;
Smith, Kayla E. ;
Haussler, David ;
Kent, W. James .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D876-D882
[6]   On the Future of Genomic Data [J].
Kahn, Scott D. .
SCIENCE, 2011, 331 (6018) :728-729
[7]  
Li H, 2009, BIOINFORMATICS, V25, P1094, DOI [10.1093/bioinformatics/btp324, 10.1093/bioinformatics/btp100]
[8]   The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data [J].
McKenna, Aaron ;
Hanna, Matthew ;
Banks, Eric ;
Sivachenko, Andrey ;
Cibulskis, Kristian ;
Kernytsky, Andrew ;
Garimella, Kiran ;
Altshuler, David ;
Gabriel, Stacey ;
Daly, Mark ;
DePristo, Mark A. .
GENOME RESEARCH, 2010, 20 (09) :1297-1303
[9]  
Prime Minister's Office, DNA TESTS REV FIGHT
[10]   A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms [J].
Sachidanandam, R ;
Weissman, D ;
Schmidt, SC ;
Kakol, JM ;
Stein, LD ;
Marth, G ;
Sherry, S ;
Mullikin, JC ;
Mortimore, BJ ;
Willey, DL ;
Hunt, SE ;
Cole, CG ;
Coggill, PC ;
Rice, CM ;
Ning, ZM ;
Rogers, J ;
Bentley, DR ;
Kwok, PY ;
Mardis, ER ;
Yeh, RT ;
Schultz, B ;
Cook, L ;
Davenport, R ;
Dante, M ;
Fulton, L ;
Hillier, L ;
Waterston, RH ;
McPherson, JD ;
Gilman, B ;
Schaffner, S ;
Van Etten, WJ ;
Reich, D ;
Higgins, J ;
Daly, MJ ;
Blumenstiel, B ;
Baldwin, J ;
Stange-Thomann, NS ;
Zody, MC ;
Linton, L ;
Lander, ES ;
Altshuler, D .
NATURE, 2001, 409 (6822) :928-933