The integration of weighted human gene association networks based on link prediction

被引:4
作者
Yang, Jian [1 ]
Yang, Tinghong [1 ]
Wu, Duzhi [1 ]
Lin, Limei [1 ]
Yang, Fan [1 ]
Zhao, Jing [1 ,2 ]
机构
[1] Logist Engn Univ, Dept Math, Chongqing, Peoples R China
[2] Shanghai Univ Tradit Chinese Med, Inst Interdisciplinary Complex Res, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Gene association network; Weighted network; Link prediction; Network integration; PROTEIN-INTERACTION NETWORKS; DATABASE; FRAMEWORK; RESOURCE; SETS;
D O I
10.1186/s12918-017-0398-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Physical and functional interplays between genes or proteins have important biological meaning for cellular functions. Some efforts have been made to construct weighted gene association meta-networks by integrating multiple biological resources, where the weight indicates the confidence of the interaction. However, it is found that these existing human gene association networks share only quite limited overlapped interactions, suggesting their incompleteness and noise. Results: Here we proposed a workflow to construct a weighted human gene association network using information of six existing networks, including two weighted specific PPI networks and four gene association meta-networks. We applied link prediction algorithm to predict possible missing links of the networks, cross-validation approach to refine each network and finally integrated the refined networks to get the final integrated network. Conclusions: The common information among the refined networks increases notably, suggesting their higher reliability. Our final integrated network owns much more links than most of the original networks, meanwhile its links still keep high functional relevance. Being used as background network in a case study of disease gene prediction, the final integrated network presents good performance, implying its reliability and application significance. Our workflow could be insightful for integrating and refining existing gene association data.
引用
收藏
页数:17
相关论文
共 57 条
[31]   Prioritizing candidate disease genes by network-based boosting of genome-wide association data [J].
Lee, Insuk ;
Blom, U. Martin ;
Wang, Peggy I. ;
Shim, Jung Eun ;
Marcotte, Edward M. .
GENOME RESEARCH, 2011, 21 (07) :1109-1121
[32]   A novel link prediction algorithm for reconstructing protein-protein interaction networks by topological similarity [J].
Lei, Chengwei ;
Ruan, Jianhua .
BIOINFORMATICS, 2013, 29 (03) :355-364
[33]   The link-prediction problem for social networks [J].
Liben-Nowell, David ;
Kleinberg, Jon .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (07) :1019-1031
[34]   A PDTB- styled end- to- end discourse parser [J].
Lin, Ziheng ;
Ng, Hwee Tou ;
Kan, Min-Yen .
NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) :151-184
[35]   Genome-wide prioritization of disease genes and identification of disease-disease associations from an integrated human functional linkage network [J].
Linghu, Bolan ;
Snitkin, Evan S. ;
Hu, Zhenjun ;
Xia, Yu ;
DeLisi, Charles .
GENOME BIOLOGY, 2009, 10 (09)
[36]   Link prediction in complex networks: A survey [J].
Lue, Linyuan ;
Zhou, Tao .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2011, 390 (06) :1150-1170
[37]   Link prediction in weighted networks: The role of weak ties [J].
Lue, Linyuan ;
Zhou, Tao .
EPL, 2010, 89 (01)
[38]   Similarity index based on local paths for link prediction of complex networks [J].
Lue, Linyuan ;
Jin, Ci-Hang ;
Zhou, Tao .
PHYSICAL REVIEW E, 2009, 80 (04)
[39]  
Marbach D, 2012, NAT METHODS, V9, P796, DOI [10.1038/NMETH.2016, 10.1038/nmeth.2016]
[40]   Link prediction based on a semi-local similarity index [J].
Meng, Bai ;
Ke, Hu ;
Yi, Tang .
CHINESE PHYSICS B, 2011, 20 (12)