A novel, privacy-preserving cryptographic approach for sharing sequencing data

被引:8
作者
Cassa, Christopher A. [1 ,2 ,3 ]
Miller, Rachel A. [3 ]
Mandl, Kenneth D. [4 ,5 ,6 ]
机构
[1] Brigham & Womens Hosp, Div Genet, Boston, MA 02215 USA
[2] Harvard Univ, Sch Med, Div Genet, Boston, MA USA
[3] MIT, CSAIL, Cambridge, MA 02139 USA
[4] Childrens Hosp, Harvard MIT Hlth Sci & Technol, Childrens Hosp Informat Program, Boston, MA 02115 USA
[5] Harvard Univ, Sch Med, Div Pediat, Boston, MA USA
[6] Childrens Hosp, Manton Ctr Orphan Dis, Boston, MA 02115 USA
关键词
MANAGED CARE ORGANIZATION; PERSONALIZED MEDICINE; GENETIC INFORMATION; GENOMIC RESEARCH; HEART-DISEASE; FRAMINGHAM; RISK; ASSOCIATION; WOMEN; DNA;
D O I
10.1136/amiajnl-2012-001366
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective DNA samples are often processed and sequenced in facilities external to the point of collection. These samples are routinely labeled with patient identifiers or pseudonyms, allowing for potential linkage to identity and private clinical information if intercepted during transmission. We present a cryptographic scheme to securely transmit externally generated sequence data which does not require any patient identifiers, public key infrastructure, or the transmission of passwords. Materials and methods This novel encryption scheme cryptographically protects participant sequence data using a shared secret key that is derived from a unique subset of an individual's genetic sequence. This scheme requires access to a subset of an individual's genetic sequence to acquire full access to the transmitted sequence data, which helps to prevent sample mismatch. Results We validate that the proposed encryption scheme is robust to sequencing errors, population uniqueness, and sibling disambiguation, and provides sufficient cryptographic key space. Discussion Access to a set of an individual's genotypes and a mutually agreed cryptographic seed is needed to unlock the full sequence, which provides additional sample authentication and authorization security. We present modest fixed and marginal costs to implement this transmission architecture. Conclusions It is possible for genomics researchers who sequence participant samples externally to protect the transmission of sequence data using unique features of an individual's genetic sequence.
引用
收藏
页码:69 / 76
页数:8
相关论文
共 41 条
[1]   GenePING: Secure, scalable management of personal genomic data [J].
Adida, Ben ;
Kohane, Isaac S. .
BMC GENOMICS, 2006, 7 (1)
[2]   Genome-wide association with select biomarker traits in the Framingham Heart Study [J].
Benjamin, Emelia J. ;
Dupuis, Josee ;
Larson, Martin G. ;
Lunetta, Kathryn L. ;
Booth, Sarah L. ;
Govindaraju, Diddahally R. ;
Kathiresan, Sekar ;
Keaney, John F., Jr. ;
Keyes, Michelle J. ;
Lin, Jing-Ping ;
Meigs, James B. ;
Robins, Sander J. ;
Rong, Jian ;
Schnabel, Renate ;
Vita, Joseph A. ;
Wang, Thomas J. ;
Wilson, Peter W. F. ;
Wolf, Philip A. ;
Vasan, Ramachandran S. .
BMC MEDICAL GENETICS, 2007, 8
[3]  
Bieber F, 2004, NEW SCI, V184, P20
[4]   Human genetics - Finding criminals through DNA of their relatives [J].
Bieber, FR ;
Brenner, CH ;
Lazer, D .
SCIENCE, 2006, 312 (5778) :1315-1316
[5]   Biobanking: freezer burn [J].
Blow, Nathan .
NATURE METHODS, 2009, 6 (02) :173-177
[6]   Application of Framingham risk estimates to ethnic minorities in United Kingdom and implications for primary prevention of heart disease in general practice: cross sectional population based study [J].
Cappuccio, FP ;
Oakeshott, P ;
Strazzullo, P ;
Kerry, SM .
BRITISH MEDICAL JOURNAL, 2002, 325 (7375) :1271-1274B
[7]   My sister's keeper?: genomic research and the identifiability of siblings [J].
Cassa, Christopher A. ;
Schmidt, Brian ;
Kohane, Isaac S. ;
Mandl, Kenneth D. .
BMC MEDICAL GENOMICS, 2008, 1 (1)
[8]   Weight, weight gain, activity, and major illnesses: The Nurses' Health Study [J].
Colditz, GA ;
Coakley, E .
INTERNATIONAL JOURNAL OF SPORTS MEDICINE, 1997, 18 :S162-S170
[9]  
EBI, 2012, EUR GEN PHEN ARCH
[10]  
EmbeddedSw.net, 2012, CRYPT 256 BIT CIPH 2