Inference of Autism-Related Genes by Integrating Protein-Protein Interactions and miRNA-Target Interactions

被引:1
作者
Dang Hung Tran [1 ]
Thanh-Phuong Nguyen [2 ]
Caberlotto, Laura [2 ]
Priami, Corrado [2 ,3 ]
机构
[1] Hanoi Natl Univ Educ, Hanoi, Vietnam
[2] Univ Trent, Microsoft Res, I-38100 Trento, Italy
[3] Univ Trent, Dept Mat, I-38100 Trento, Italy
来源
KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 1 | 2014年 / 244卷
关键词
DISEASE-GENES; MICRORNA; DATABASE; NETWORK; BIOGENESIS;
D O I
10.1007/978-3-319-02741-8_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autism spectrum disorders (ASD) are a group of conditions characterized by impairments in social interaction and presence of repetitive behavior. These complex neurological diseases are among the fastest growing developmental disorders and cause varying degrees of lifelong disabilities. There have been a lot of ongoing research to unravel the pathogenic mechanism of autism. Computational methods have come to the scene as a promising approach to aid the physicians in studying autism. In this paper, we present an efficient method to predict autism-related candidate genes (autism genes in short) by integrating protein interaction network and miRNA-target interaction network. We combine the two networks by a new technique relying on shortest path calculation. To demonstrate the high performance of our method, we run several experiments on three different PPI networks extracted from the BioGRID database, the HINT database, and the HPRD database. Three supervised learning algorithms were employed, i.e., the Bayesian network and the random tree and the random forest. Among them, the random forest method performs better in terms of precision, recall, and F-measure. It shows that the random forest algorithmis potential to infer autism genes. Carrying out the experiments with five different lengths of the shortest paths in the PPI networks, the results show the advantage of the method in studying autism genes based on the large scale network. In conclusion, the proposed method is beneficial in deciphering the pathogenic mechanism of autism.
引用
收藏
页码:299 / 311
页数:13
相关论文
共 29 条
[1]   Speeding disease gene discovery by sequence based candidate prioritization [J].
Adie, EA ;
Adams, RR ;
Evans, KL ;
Porteous, DJ ;
Pickard, BS .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]  
Barrett Tanya, 2006, V338, P175
[3]   MicroRNAs: Genomics, biogenesis, mechanism, and function (Reprinted from Cell, vol 116, pg 281-297, 2004) [J].
Bartel, David P. .
CELL, 2007, 131 (04) :11-29
[4]  
Borgwardt KM, 2007, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2007, P4
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Chan A.W.S., 2012, FRONT GENET, V82, P1
[7]  
Chatr-aryamontri A., 2012, NUCL ACIDS RES
[8]   HINT: High-quality protein interactomes and their applications in understanding human disease [J].
Das, Jishnu ;
Yu, Haiyuan .
BMC SYSTEMS BIOLOGY, 2012, 6
[9]   The Comparative Toxicogenomics Database: update 2013 [J].
Davis, Allan Peter ;
Murphy, Cynthia Grondin ;
Johnson, Robin ;
Lay, Jean M. ;
Lennon-Hopkins, Kelley ;
Saraceni-Richards, Cynthia ;
Sciaky, Daniela ;
King, Benjamin L. ;
Rosenstein, Michael C. ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D1104-D1114
[10]   The human disease network [J].
Goh, Kwang-Il ;
Cusick, Michael E. ;
Valle, David ;
Childs, Barton ;
Vidal, Marc ;
Barabasi, Albert-Laszlo .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (21) :8685-8690