Inferring Association between Compound and Pathway with an Improved Ensemble Learning Method

被引:6
作者
Song, Meiyue [2 ]
Jiang, Zhenran [1 ,2 ]
机构
[1] E China Normal Univ, Dept Comp Sci & Technol, Shanghai 200241, Peoples R China
[2] E China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
关键词
Ensemble Learning; RGRF method; Rotation forest; Compound-pathway interaction; TARGET INTERACTION NETWORKS; CONNECTIVITY MAP; DRUG; PREDICTION; GENES; INTEGRATION; DISCOVERY; DATABASE; SYSTEM;
D O I
10.1002/minf.201500033
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Emergence of compound molecular data coupled to pathway information offers the possibility of using machine learning methods for compound-pathway associations' inference. To provide insights into the global relationship between compounds and their affected pathways, a improved Rotation Forest ensemble learning method called RGRF (Relief & GBSSL - Rotation Forest) was proposed to predict their potential associations. The main characteristic of the RGRF lies in using the Relief algorithm for feature extraction and regarding the Graph-Based Semi-Supervised Learning method as classifier. By incorporating the chemical structure information, drug mode of action information and genomic space information, our method can achieve a better precision and flexibility on compound-pathway prediction. Moreover, several new compound-pathway associations that having the potential for further clinical investigation have been identified by database searching. In the end, a prediction tool was developed using RGRF algorithm, which can predict the interactions between pathways and all of the compounds in cMap database.
引用
收藏
页码:753 / 760
页数:8
相关论文
共 34 条
  • [1] Principal component analysis
    Abdi, Herve
    Williams, Lynne J.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04): : 433 - 459
  • [2] CancerResource: a comprehensive database of cancer-relevant proteins and compound interactions supported by experimental knowledge
    Ahmed, Jessica
    Meinel, Thomas
    Dunkel, Mathias
    Murgueitio, Manuela S.
    Adams, Robert
    Blasse, Corinna
    Eckert, Andreas
    Preissner, Saskia
    Preissner, Robert
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D960 - D967
  • [3] [Anonymous], 2013, RDKIT CHEMINFORMATIC
  • [4] System-based drug discovery within the human kinome
    Bamborough, Paul
    [J]. EXPERT OPINION ON DRUG DISCOVERY, 2012, 7 (11) : 1053 - 1070
  • [5] Supervised prediction of drug-target interactions using bipartite local models
    Bleakley, Kevin
    Yamanishi, Yoshihiro
    [J]. BIOINFORMATICS, 2009, 25 (18) : 2397 - 2403
  • [6] Bolstad B. M., 2007, R PACKAGE
  • [7] Cheriyadat A, 2003, INT GEOSCI REMOTE SE, P3420
  • [8] The Comparative Toxicogenomics Database: update 2011
    Davis, Allan Peter
    King, Benjamin L.
    Mockus, Susan
    Murphy, Cynthia G.
    Saraceni-Richards, Cynthia
    Rosenstein, Michael
    Wiegers, Thomas
    Mattingly, Carolyn J.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D1067 - D1072
  • [9] Fern Xiaoli Zhang, 2003, ICML, P186
  • [10] Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways
    Hattori, M
    Okuno, Y
    Goto, S
    Kanehisa, M
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2003, 125 (39) : 11853 - 11865