Prioritizing Disease Genes by Using Search Engine Algorithm

被引:22
作者
Li, Min [1 ,2 ]
Zheng, Ruiqing [1 ]
Li, Qi [1 ]
Wang, Jianxin [1 ]
Wu, Fang-Xiang [1 ,3 ,4 ]
Zhang, Zhuohua [2 ]
机构
[1] Cent S Univ, Sch Informat Sci & Engn, Changsha 410083, Peoples R China
[2] State Key Lab Med Genet, Changsha, Hunan, Peoples R China
[3] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 0W0, Canada
[4] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK, Canada
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
systems biology; protein-protein interaction network; disease gene; search engine algorithm; random walk; disease similarity; PROTEIN-INTERACTION NETWORKS; CANDIDATE GENES; FUNCTIONAL MODULES; PPI NETWORK; IDENTIFICATION; INTERACTOME; INFORMATION; SIMILARITY; DOCKING;
D O I
10.2174/1574893611666160125220905
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
It is a fundamental challenge that identifying disease genes from a large number of candidates for a specific disease. As the biological experiment-based methods are generally time-consuming and laborious, it has become a new strategy to identify disease candidates by using computational approaches. In this paper, we proposed an algorithm based on the search engine ranking method, named PDGTR, to prioritize disease candidates. Firstly, we constructed a weighted human disease network by calculating the topological similarity and phenotype similarity of each pair of diseases. Then, we calculated the similarities of all the genes by using the protein-protein interaction network and the edge clustering coefficient. For a specific disease, a logistic regression model was used to generate the prior-knowledge of each gene. Finally, the search engine ranking based algorithm PDGTR was applied to prioritize the disease candidates. The proposed algorithm PDGTR was tested on five typical cancers: Breast Cancer, Colorectal Cancer, Hepatocellular carcinoma, Gastric Cancer and Osteoporosis, and compared with four state-of-the-art algorithms: RWR, DADA, PRINCE and PRP. The experimental results based on the leave-one-out cross validation, precision, ROC curve, and enrichment show that the proposed algorithm PDGTR outperforms RWR, DADA, PRINCE and PRP. Moreover, some potential disease genes were predicted by PDGTR and already mentioned by some literatures.
引用
收藏
页码:195 / 202
页数:8
相关论文
共 38 条
[2]   International epidemiology of prostate cancer: Geographical distribution and secular trends [J].
Baade, Peter D. ;
Youlden, Danny R. ;
Krnjacki, Lauren J. .
MOLECULAR NUTRITION & FOOD RESEARCH, 2009, 53 (02) :171-184
[3]   Disease candidate gene identification and prioritization using protein interaction networks [J].
Chen, Jing ;
Aronow, Bruce J. ;
Jegga, Anil G. .
BMC BIOINFORMATICS, 2009, 10
[4]   HNF1B deletions in patients with young-onset diabetes but no known renal disease [J].
Edghill, E. L. ;
Stals, K. ;
Oram, R. A. ;
Shepherd, M. H. ;
Hattersley, A. T. ;
Ellard, S. .
DIABETIC MEDICINE, 2013, 30 (01) :114-117
[5]   DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization [J].
Erten, Sinan ;
Bebek, Gurkan ;
Ewing, Rob M. ;
Koyutuerk, Mehmet .
BIODATA MINING, 2011, 4
[6]   Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes [J].
Franke, Lude ;
van Bakel, Harm ;
Fokkens, Like ;
de Jong, Edwin D. ;
Egmont-Petersen, Michael ;
Wijmenga, Cisca .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 78 (06) :1011-1025
[7]   The human disease network [J].
Goh, Kwang-Il ;
Cusick, Michael E. ;
Valle, David ;
Childs, Barton ;
Vidal, Marc ;
Barabasi, Albert-Laszlo .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (21) :8685-8690
[8]   Novel GATA5 loss-of-function mutations underlie familial atrial fibrillation [J].
Gu, Jian-Yun ;
Xu, Jia-Hong ;
Yu, Hong ;
Yang, Yi-Qing .
CLINICS, 2012, 67 (12) :1393-1399
[9]   Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders [J].
Hamosh, A ;
Scott, AF ;
Amberger, J ;
Bocchini, C ;
Valle, D ;
McKusick, VA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :52-55
[10]   Prioritizing disease candidate genes by a gene interconnectedness-based approach [J].
Hsu, Chia-Lang ;
Huang, Yen-Hua ;
Hsu, Chien-Ting ;
Yang, Ueng-Cheng .
BMC GENOMICS, 2011, 12