Prediction of microRNA-regulated protein interaction pathways in Arabidopsis using machine learning algorithms

被引:11
作者
Kurubanjerdjit, Nilubon [1 ,2 ]
Huang, Chien-Hung [3 ]
Lee, Yu-Liang [1 ]
Tsai, Jeffrey J. P. [1 ]
Ng, Ka-Lok [1 ,4 ]
机构
[1] Asia Univ, Dept Biomed Informat, Taichung 41354, Taiwan
[2] Mea Fah Luang Univ, Sch Informat Technol, Chiang Rai 57100, Thailand
[3] Natl Formosa Univ, Dept Comp Sci & Informat Engn, Yunlin 63205, Taiwan
[4] China Med Univ, Sch Pharm, Taichung 40402, Taiwan
关键词
Arabidopsis thaliana; microRNA target prediction; Machine learning algorithm; Protein-protein interaction; Enrichment analysis; Jacard coefficient; WEB-TOOL; IDENTIFICATION; DATABASE; TARGETS; GENES;
D O I
10.1016/j.compbiomed.2013.08.010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
MicroRNAs are small, endogenous RNAs found in many different species and are known to have an influence on diverse biological phenomena. They also play crucial roles in plant biological processes, such as metabolism, leaf sidedness and flower development. However, the functional roles of most microRNAs are still unknown. The identification of closely related microRNAs and target genes can be an essential first step towards the discovery of their combinatorial effects on different cellular states. A lot of research has tried to discover microRNAs and target gene interactions by implementing machine learning classifiers with target prediction algorithms. However, high rates of false positives have been reported as a result of undetermined factors which will affect recognition. Therefore, integrating diverse techniques could improve the prediction. In this paper we propose identifying microRNAs target of Arabidopsis thaliana by integrating prediction scores from PITA, miRanda and RNAHybrid algorithms used as a feature vector of microRNA-target interactions, and then implementing SVM, random forest tree and neural network machine learning algorithms to make final predictions by majority voting. Furthermore, microRNA target genes are linked with their protein-protein interaction (PPI) partners. We focus on plant resistance genes and transcription factor information to provide new insights into plant pathogen interaction networks. Downstream pathways are characterized by the Jaccard coefficient, which is implemented based on Gene Ontology. The database is freely accessible at http://ppi.bioinfo.asia.edu.tw/At_miRNA/. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1645 / 1652
页数:8
相关论文
共 31 条
[1]  
Alpaydin E., 2011, Introduction to Machine Learning, V2nd
[2]   Comprehensive prediction of novel microRNA targets in Arabidopsis thaliana [J].
Alves-Junior, Leonardo ;
Niemeier, Sandra ;
Hauenschild, Arne ;
Rehmsmeier, Marc ;
Merkle, Thomas .
NUCLEIC ACIDS RESEARCH, 2009, 37 (12) :4010-4021
[3]  
[Anonymous], 2003, PRACTICAL GUIDE SUPP
[4]   MicroRNAs: Genomics, biogenesis, mechanism, and function (Reprinted from Cell, vol 116, pg 281-297, 2004) [J].
Bartel, David P. .
CELL, 2007, 131 (04) :11-29
[5]   The BioGRID interaction database:: 2008 update [J].
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Reguly, Teresa ;
Boucher, Lorrie ;
Breitkreutz, Ashton ;
Livstone, Michael ;
Oughtred, Rose ;
Lackner, Daniel H. ;
Bahler, Jurg ;
Wood, Valerie ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D637-D640
[6]   'MicroRNA Targets', a new AthaMap web-tool for genome-wide identification of miRNA targets in Arabidopsis thaliana [J].
Buelow, Lorenz ;
Bolivar, Julio C. ;
Ruhe, Jonas ;
Brill, Yuri ;
Hehl, Reinhard .
BIODATA MINING, 2012, 5
[7]   Identification of new microRNA-regulated genes by conserved targeting in plant species [J].
Chorostecki, Uciel ;
Crosa, Valeria A. ;
Lodeyro, Anabella F. ;
Bologna, Nicolas G. ;
Martin, Ana P. ;
Carrillo, Nestor ;
Schommer, Carla ;
Palatnik, Javier F. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (18) :8893-8904
[8]   psRNATarget: a plant small RNA target analysis server [J].
Dai, Xinbin ;
Zhao, Patrick Xuechun .
NUCLEIC ACIDS RESEARCH, 2011, 39 :W155-W159
[9]   Computational analysis of miRNA targets in plants: current status and challenges [J].
Dai, Xinbin ;
Zhuang, Zhaohong ;
Zhao, Patrick Xuechun .
BRIEFINGS IN BIOINFORMATICS, 2011, 12 (02) :115-121
[10]  
Enright AJ, 2004, GENOME BIOL, V5