Prediction of Disease Associated Long Non-Coding RNA Based on HeteSim

被引:0
|
作者
Ma Y. [1 ]
Guo X. [1 ]
Sun Y. [1 ]
Yuan Q. [1 ]
Ren Y. [1 ]
Duan R. [1 ]
Gao L. [1 ]
机构
[1] School of Computer Science and Technology, Xidian University, Xi'an
基金
中国国家自然科学基金;
关键词
Correlation calculation; Disease-gene prediction; Heterogeneous information networks; HeteSim; Meta-path;
D O I
10.7544/issn1000-1239.2019.20180834
中图分类号
学科分类号
摘要
A growing number of evidences indicate that long non-coding RNAs (lncRNAs) play important roles in many biological processes, and mutations or dysfunction in these long non-coding RNAs can cause serious diseases in human bodies, such as various cancers. Biological methods have been exploited to predict potential associations between diseases and long non-coding RNAs, which are of great significance for the exploration of pathogenesis, diagnosis, treatment, prognosis and prevention of complex diseases. Heterogeneous information network is constructed based on the known disease-gene associations. The association strength between lncRNAs and diseases can be measured by an association score in the heterogeneous network. A simple method called HeteSim is applied to calculate the association scores between lncRNAs and diseases. The method used in this paper is based on all paths existing between a given disease and a given lncRNA. The experiments show that our method can achieve superior performance than state-of-art methods. Our predictions for ovarian cancer and gastric cancer have been verified by biological experiments, indicating the effectiveness of this method. The case studies indicate that our method can give informative clues for further investigation. In conclusion, the only paths based on known disease-gene associations are exploited, and it is can be expected that other disease associated information can also be integrated into our method, and better performance can be available. © 2019, Science Press. All right reserved.
引用
收藏
页码:1889 / 1896
页数:7
相关论文
共 34 条
  • [1] Bertone P., Stolc V., Royce T.E., Et al., Global identification of human transcribed sequences with genome tiling arrays, Science, 306, 5705, pp. 2242-2246, (2004)
  • [2] Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project, Nature, 447, 7146, pp. 799-816, (2007)
  • [3] Kapranov P., Cheng J., Dike S., Et al., RNA maps reveal new RNA classes and a possible function for pervasive transcription, Science, 316, 5830, pp. 1484-1488, (2007)
  • [4] Carninci P., Kasukawa T., Katayama S., Et al., The transcriptional landscape of the mammalian genome, Science, 309, 5740, pp. 1559-1563, (2005)
  • [5] Taft R.J., Pang K.C., Mercer T.R., Et al., Non-coding RNAs: Regulators of disease, The Journal of Pathology, 220, 2, pp. 126-139, (2010)
  • [6] Derrien T., Johnson R., Bussotti G., Et al., The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression, Genome Research, 22, 9, pp. 1775-1789, (2012)
  • [7] Quek X.C., Thomson D.W., Maag J.L., Et al., lncRNAdb v2.0: Expanding the reference database for functional long noncoding RNAs, Nucleic Acids Research, 43, D1, pp. 168-173, (2015)
  • [8] Chakraborty S., Deb A., Maji R.K., Et al., LncRBase: An enriched resource for lncRNA information, PloS One, 9, 9, (2014)
  • [9] Jiang Q., Ma R., Wang J., Et al., LncRNA2-Function: A comprehensive resource for functional investigation of human lncRNAs based on RNA-seq data, BMC Genomics, 16, (2015)
  • [10] Jiang Q., Wang J., Wu X., Et al., LncRNA2Target: A database for differentially expressed genes after lncRNA knockdown or overexpression, Nucleic Acids Research, 43, D1, pp. D193-D196, (2014)