An information theoretic approach to assessing Gene-Ontology-driven similarity and its application

被引:0
作者
Wang, Haiying [1 ]
Azuaje, Francisco [2 ]
Zheng, Huiru [1 ]
机构
[1] Univ Ulster, Sch Comp & Math, Comp Sci Res Inst, Belfast, Antrim, North Ireland
[2] Publ Res Ctr Hlth CRP Sante, Cardiovasc Res Lab, L-1150 Luxembourg, Luxembourg
关键词
semantic similarity; GO; gene ontology; information content;
D O I
10.1504/IJDMB.2014.059061
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Using information-theoretic approaches, this paper presents a cross-platform system to support the integration of Gene Ontology (GO)-driven similarity knowledge into functional genomics. Three GO-driven similarity measures (Resnik's, Lin's and Jiang's metrics) have been implemented to measure between-term similarity within each of the GO hierarchies. Two approaches (simple and highest average similarity) which are based on the aggregation of between-term similarities, are used to estimate the similarity between gene products. The system has been successfully applied to a number of applications including assessing gene expression correlation patterns and the relationships between GO-driven similarity and other functional properties.
引用
收藏
页码:121 / 134
页数:14
相关论文
共 25 条
[1]  
[Anonymous], 2005, P ISMB 2005 SIG M BI
[2]  
Ashburner M, 2001, GENOME RES, V11, P1425, DOI 10.1101/gr.180801
[3]   Incorporating ontology-driven similarity knowledge into functional genomics: An exploratory study [J].
Azuaje, F ;
Bodenreider, O .
BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, :317-324
[4]  
Azuaje F, 2006, ICDM 2006: Sixth IEEE International Conference on Data Mining, Workshops, P114
[5]   MGD: the Mouse Genome Database [J].
Blake, JA ;
Richardson, JE ;
Bult, RJ ;
Kadin, JA ;
Eppig, JT .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :193-195
[6]   A knowledge-driven probabilistic framework for the prediction of protein-protein interaction networks [J].
Browne, Fiona ;
Wang, Haiying ;
Zheng, Huiru ;
Azuaje, Francisco .
COMPUTERS IN BIOLOGY AND MEDICINE, 2010, 40 (03) :306-317
[7]   Semantic similarity based feature extraction from microarray expression data [J].
Cho, Young-Rae ;
Zhang, Aidong ;
Xu, Xian .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2009, 3 (03) :333-345
[8]   Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO) [J].
Dwight, SS ;
Harris, MA ;
Dolinski, K ;
Ball, CA ;
Binkley, G ;
Christie, KR ;
Fisk, DG ;
Issel-Tarver, L ;
Schroeder, M ;
Sherlock, G ;
Sethuraman, A ;
Weng, S ;
Botstein, D ;
Cherry, JM .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :69-72
[9]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[10]   The FlyBase database of the Drosophila genome projects and community literature [J].
Gelbart, W ;
Bayraktaroglu, L ;
Bettencourt, B ;
Campbell, K ;
Crosby, M ;
Emmert, D ;
Hradecky, P ;
Huang, Y ;
Letovsky, S ;
Matthews, B ;
Russo, S ;
Schroeder, A ;
Smutniak, F ;
Zhou, P ;
Zytkovicz, M ;
Ashburner, M ;
Drysdale, R ;
de Grey, A ;
Foulger, R ;
Millburn, G ;
Yamada, C ;
Kaufman, T ;
Matthews, K ;
Gilbert, D ;
Grumbling, G ;
Strelets, V ;
Shemen, C ;
Rubin, G ;
Berman, B ;
Frise, E ;
Gibson, M ;
Harris, N ;
Kaminker, J ;
Lewis, S ;
Marshall, B ;
Misra, S ;
Mungall, C ;
Prochnik, S ;
Richter, J ;
Smith, C ;
Shu, S ;
Tupy, J ;
Wiel, C .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :172-175