Machine learning approaches for predicting biomolecule-disease associations

被引:16
作者
Ding, Yulian [1 ]
Lei, Xiujuan [2 ]
Liao, Bo [3 ]
Wu, Fang-Xiang [4 ,5 ]
机构
[1] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK, Canada
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China
[3] Hainan Normal Univ, Sch Math & Stat, Haikou, Hainan, Peoples R China
[4] Univ Saskatchewan, Coll Engn, Saskatoon, SK, Canada
[5] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
biomolecule-disease association; multi-view data source; feature representation; machine learning; deep learning; non-negative matrix factorization; UPDATED DATABASE; NONCODING RNAS; NETWORK; MICRORNAS; INFERENCE; GENE; REPRESENTATION; EXPRESSION; SIMILARITY; INSIGHTS;
D O I
10.1093/bfgp/elab002
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Biomolecules, such as microRNAs, circRNAs, lncRNAs and genes, are functionally interdependent in human cells, and all play critical roles in diverse fundamental and vital biological processes. The dysregulations of such biomolecules can cause diseases. Identifying the associations between biomolecules and diseases can uncover the mechanisms of complex diseases, which is conducive to their diagnosis, treatment, prognosis and prevention. Due to the time consumption and cost of biologically experimental methods, many computational association prediction methods have been proposed in the past few years. In this study, we provide a comprehensive review of machine learning-based approaches for predicting disease-biomolecule associations with multi-view data sources. Firstly, we introduce some databases and general strategies for integrating multi-view data sources in the prediction models. Then we discuss several feature representation methods for machine learning-based prediction models. Thirdly, we comprehensively review machine learning-based prediction approaches in three categories: basic machine learning methods, matrix completion-based methods and deep learning-based methods, while discussing their advantages and disadvantages. Finally, we provide some perspectives for further improving biomolecule-disease prediction methods.
引用
收藏
页码:273 / 287
页数:15
相关论文
共 138 条
[1]   OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders [J].
Amberger, Joanna S. ;
Bocchini, Carol A. ;
Schiettecatte, Francois ;
Scott, Alan F. ;
Hamosh, Ada .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D789-D798
[2]   microRNAs: Tiny regulators with great potential [J].
Ambros, V .
CELL, 2001, 107 (07) :823-826
[3]   MicroRNA pathways in flies and worms: Growth, death, fat, stress, and timing [J].
Ambros, V .
CELL, 2003, 113 (06) :673-676
[4]   The functions of animal microRNAs [J].
Ambros, V .
NATURE, 2004, 431 (7006) :350-355
[5]   Integrating node embeddings and biological annotations for genes to predict disease-gene associations [J].
Ata, Sezin Kircali ;
Le Ou-Yang ;
Fang, Yuan ;
Kwoh, Chee-Keong ;
Wu, Min ;
Li, Xiao-Li .
BMC SYSTEMS BIOLOGY, 2018, 12
[6]   LncRNADisease 2.0: an updated database of long non-coding RNA-associated diseases [J].
Bao, Zhenyu ;
Yang, Zhen ;
Huang, Zhou ;
Zhou, Yiran ;
Cui, Qinghua ;
Dong, Dong .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D1034-D1037
[7]   MicroRNAs: Genomics, biogenesis, mechanism, and function (Reprinted from Cell, vol 116, pg 281-297, 2004) [J].
Bartel, David P. .
CELL, 2007, 131 (04) :11-29
[8]  
Becker KG, 2004, NAT GENET, V36, P431, DOI 10.1038/ng0504-431
[9]   SomamiR 2.0: a database of cancer somatic mutations altering microRNA-ceRNA interactions [J].
Bhattacharya, Anindya ;
Cui, Yan .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D1005-D1010
[10]   The art and design of genetic screens: RNA interference [J].
Boutros, Michael ;
Ahringer, Julie .
NATURE REVIEWS GENETICS, 2008, 9 (07) :554-566