Identity inference of genomic data using long-range familial searches

被引:209
作者
Erlich, Yaniv [1 ,2 ,3 ,4 ]
Shor, Tal [1 ]
Pe'er, Itsik [2 ,3 ]
Carmi, Shai [5 ]
机构
[1] MyHeritage, IL-6037606 Or Yehuda, Israel
[2] Columbia Univ, Dept Comp Sci, Fu Fdn Sch Engn, New York, NY 10027 USA
[3] Columbia Univ, Dept Syst Biol, Ctr Computat Biol & Bioinformat C2B2, New York, NY 10027 USA
[4] New York Genome Ctr, New York, NY 10013 USA
[5] Hebrew Univ Jerusalem, Braun Sch Publ Hlth & Community Med, Jerusalem, Israel
基金
以色列科学基金会;
关键词
D O I
10.1126/science.aau4832
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Consumer genomics databases have reached the scale of millions of individuals. Recently, law enforcement authorities have exploited some of these databases to identify suspects via distant familial relatives. Using genomic data of 1.28 million individuals tested with consumer genomics, we investigated the power of this technique. We project that about 60% of the searches for individuals of European descent will result in a third-cousin or closer match, which theoretically allows their identification using demographic identifiers. Moreover, the technique could implicate nearly any U.S. individual of European descent in the near future. We demonstrate that the technique can also identify research participants of a public sequencing project. On the basis of these results, we propose a potential mitigation strategy and policy implications for human subject research.
引用
收藏
页码:690 / +
页数:36
相关论文
共 42 条
[1]  
Aldhous P., 2018, BUZZFEED
[2]  
[Anonymous], 2018, N.Y. TIMES
[3]  
[Anonymous], 2017, FED REGISTER, V82, P7149
[4]  
[Anonymous], 2018, WASHINGTON POST
[5]  
Augenstein S., 2018, FORENSIC 0509
[6]  
Augenstein S., 2018, FORENSIC 0416
[7]   High-speed high-security signatures [J].
Bernstein, Daniel J. ;
Duif, Niels ;
Lange, Tanja ;
Schwabe, Peter ;
Yang, Bo-Yin .
JOURNAL OF CRYPTOGRAPHIC ENGINEERING, 2012, 2 (02) :77-89
[8]  
Bettinger B. T., 2017, SHARED CM PROJECT VE
[9]  
DeMille D., 2018, SPECTRUM
[10]  
Edge D., 2018, GCBIAS 0507