DNA Motif Finding Method Without Protection Can Leak User Privacy

被引:11
作者
Wu, Xiang [1 ,2 ]
Wang, Huanhuan [1 ,2 ]
Shi, Minyu [1 ,2 ]
Wang, Aming [1 ,2 ]
Xia, Kaijian [3 ]
机构
[1] Xuzhou Med Univ, Inst Med Informat, Xuzhou 221000, Jiangsu, Peoples R China
[2] Xuzhou Med Univ, Inst Med Informat & Hlth Big Data, Xuzhou 221000, Jiangsu, Peoples R China
[3] Soochow Univ, Changshu 1 Peoples Hosp, Changshu Affiliated Hosp, Changshu 215500, Jiangsu, Peoples R China
关键词
DNA; Data privacy; Bioinformatics; Genomics; Privacy; Databases; DNA motif finding; privacy disclosure; DNA sequences; privacy protection; FACTOR-BINDING SITES; POSITION FREQUENCY MATRICES; REGULATORY ELEMENTS; GENE POLYMORPHISM; DISCOVERY; IDENTIFICATION; SIMILARITY; ALGORITHM; SEQUENCES; PATTERNS;
D O I
10.1109/ACCESS.2019.2947261
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
DNA sequence analysis plays an important role in the study of gene regulatory networks. DNA motif finding has become a key discipline in the post-gene era and gradually become a research hotspot by mining key gene sequences corresponding to disease mechanism and important biological functions. However, the research of DNA motif finding is faced with a huge problem of privacy disclosure. DNA motif finding technology cannot manage and use data well under controllable conditions, and the mining process of DNA motif finding itself is prone to reveal private information such as individual traits, characteristics and disease defects. In this paper, we presented an overview of the privacy breaching of DNA motif finding, summarized the main methods and tools of the current DNA motif finding, analyzed its privacy risks, and used two case studies to verify that the DNA motif finding may identify individual privacy information. Finally, we discussed the privacy protection methods for motif finding and proposed the privacy protection solutions.
引用
收藏
页码:152076 / 152087
页数:12
相关论文
共 75 条
[1]   A pan-cancer proteomic perspective on The Cancer Genome Atlas [J].
Akbani, Rehan ;
Ng, Patrick Kwok Shing ;
Werner, Henrica M. J. ;
Shahmoradgoli, Maria ;
Zhang, Fan ;
Ju, Zhenlin ;
Liu, Wenbin ;
Yang, Ji-Yeon ;
Yoshihara, Kosuke ;
Li, Jun ;
Ling, Shiyun ;
Seviour, Elena G. ;
Ram, Prahlad T. ;
Minna, John D. ;
Diao, Lixia ;
Tong, Pan ;
Heymach, John V. ;
Hill, Steven M. ;
Dondelinger, Frank ;
Stadler, Nicolas ;
Byers, Lauren A. ;
Meric-Bernstam, Funda ;
Weinstein, John N. ;
Broom, Bradley M. ;
Verhaak, Roeland G. W. ;
Liang, Han ;
Mukherjee, Sach ;
Lu, Yiling ;
Mills, Gordon B. .
NATURE COMMUNICATIONS, 2014, 5
[2]   Discovering Gene Regulatory Elements Using Coverage-Based Heuristics [J].
Al-Ouran, Rami ;
Schmidt, Robert ;
Naik, Ashwini ;
Jones, Jeffrey ;
Drews, Frank ;
Juedes, David ;
Elnitski, Laura ;
Welch, Lonnie .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (04) :1290-1300
[3]   Genetic privacy needs a more nuanced approach [J].
Angrist, Misha .
NATURE, 2013, 494 (7435) :7-7
[4]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[5]   MEME: discovering and analyzing DNA and protein sequence motifs [J].
Bailey, Timothy L. ;
Williams, Nadya ;
Misleh, Chris ;
Li, Wilfred W. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W369-W373
[6]  
Boucher C, 2010, BMC BIOINFORMATICS, V11, P1
[7]  
Boucher C., 2007, P INT WORKSH ALG BIO, V4645, P60
[8]  
Bregman-Eschet Y., 2006, Santa Clara Computer and High-Technology Law Journal, V23, P1
[9]  
Carter A. B., 2013, J MOL DIAGN, V21, P542
[10]   Association of TBX20 Gene Polymorphism with Congenital Heart Disease in Han Chinese Neonates [J].
Chen, Junhua ;
Sun, Fuqiang ;
Fu, Jia ;
Zhang, Hongyan .
PEDIATRIC CARDIOLOGY, 2015, 36 (04) :737-742