An improved method for functional similarity analysis of genes based on Gene Ontology

被引:22
作者
Tian, Zhen [1 ]
Wang, Chunyu [1 ]
Guo, Maozu [1 ]
Liu, Xiaoyan [1 ]
Teng, Zhixia [1 ,2 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
[2] Northeast Forestry Univ, Dept Informat Management & Informat Syst, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Gene Ontology; Specificity of terms; Weighted inherited semantics; Gene functional similarity; SEMANTIC SIMILARITY; MICROARRAY DATA; EXPRESSION; TERMS; NETWORKS; CLUSTERS; TAXONOMY; FEATURES; MODULES;
D O I
10.1186/s12918-016-0359-z
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Measures of gene functional similarity are essential tools for gene clustering, gene function prediction, evaluation of protein-protein interaction, disease gene prioritization and other applications. In recent years, many gene functional similarity methods have been proposed based on the semantic similarity of GO terms. However, these leading approaches may make errorprone judgments especially when they measure the specificity of GO terms as well as the IC of a term set. Therefore, how to estimate the gene functional similarity reliably is still a challenging problem. Results: We propose WIS, an effective method to measure the gene functional similarity. First of all, WIS computes the IC of a term by employing its depth, the number of its ancestors as well as the topology of its descendants in the GO graph. Secondly, WIS calculates the IC of a term set by means of considering the weighted inherited semantics of terms. Finally, WIS estimates the gene functional similarity based on the IC overlap ratio of term sets. WIS is superior to some other representative measures on the experiments of functional classification of genes in a biological pathway, collaborative evaluation of GO-based semantic similarity measures, protein-protein interaction prediction and correlation with gene expression. Further analysis suggests that WIS takes fully into account the specificity of terms and the weighted inherited semantics of terms between GO terms. Conclusions: The proposed WIS method is an effective and reliable way to compare gene function. The web service of WIS is freely available at http://nclab.hit.edu.cn/WIS/.
引用
收藏
页数:20
相关论文
共 60 条
[1]  
[Anonymous], 2002, Proceedings of the 19th International Conference on Computational Linguistics, DOI [10.3115/1072228.1072318, DOI 10.3115/1072228.1072318]
[2]  
[Anonymous], 1998, An information-theoretic definition of similarity
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   The universal protein resource (UniProt) [J].
Bairoch, Amos ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Puy, Ghislaine Argoud ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
deCastro, Edouard ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
Dobrokhotov, Pavel ;
Dornevil, Dolnide ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Feuermann, Marc ;
Gehant, Sebastian ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
Ioannidis, Vassilios ;
Ivanyi, Ivan ;
James, Janet ;
Jain, Eric ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra ;
Lara, Vicente ;
Lemercier, Philippe ;
Le Saux, Virginie .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D193-D197
[5]   A New Path Based Hybrid Measure for Gene Ontology Similarity [J].
Bandyopadhyay, Sanghamitra ;
Mallick, Koushik .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (01) :116-127
[6]   Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps [J].
Brameier, Markus ;
Wiuf, Carsten .
JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (02) :160-173
[7]   A transversal approach to predict gene product networks from ontology-based similarity [J].
Chabalier, Julie ;
Mosser, Jean ;
Burgun, Anita .
BMC BIOINFORMATICS, 2007, 8 (1)
[8]   Disease candidate gene identification and prioritization using protein interaction networks [J].
Chen, Jing ;
Aronow, Bruce J. ;
Jegga, Anil G. .
BMC BIOINFORMATICS, 2009, 10
[9]  
Cheng Jill, 2004, J Biopharm Stat, V14, P687, DOI 10.1081/BIP-200025659
[10]  
Cheng L, 2014, SEMFUNSIM NEW METHOD