Evaluation of GO-based functional similarity measures using S. cerevisiae protein interaction and expression profile data

被引:66
作者
Xu, Tao [1 ,2 ]
Du, LinFang [1 ]
Zhou, Yan [2 ,3 ]
机构
[1] Sichuan Univ, Coll Life Sci, Chengdu 610064, Peoples R China
[2] Chinese Natl Human Genome Ctr Shanghai, Shanghai MOST Key Lab Hlth & Dis Genom, Shanghai 201203, Peoples R China
[3] Fudan Univ, Sch Life Sci, Dept Microbiol, Shanghai 200433, Peoples R China
关键词
D O I
10.1186/1471-2105-9-472
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Researchers interested in analysing the expression patterns of functionally related genes usually hope to improve the accuracy of their results beyond the boundaries of currently available experimental data. Gene ontology (GO) data provides a novel way to measure the functional relationship between gene products. Many approaches have been reported for calculating the similarities between two GO terms, known as semantic similarities. However, biologists are more interested in the relationship between gene products than in the scores linking the GO terms. To highlight the relationships among genes, recent studies have focused on functional similarities. Results: In this study, we evaluated five functional similarity methods using both protein-protein interaction (PPI) and expression data of S. cerevisiae. The receiver operating characteristics (ROC) and correlation coefficient analysis of these methods showed that the maximum method outperformed the other methods. Statistical comparison of multiple-and single-term annotated proteins in biological process ontology indicated that genes with multiple GO terms may be more reliable for separating true positives from noise. Conclusion: This study demonstrated the reliability of current approaches that elevate the similarity of GO terms to the similarity of proteins. Suggestions for further improvements in functional similarity analysis are also provided.
引用
收藏
页数:10
相关论文
共 33 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces [J].
Aytuna, AS ;
Gursoy, A ;
Keskin, O .
BIOINFORMATICS, 2005, 21 (12) :2850-2855
[3]  
Azuaje F, 2006, METH MOL B, V316, P67
[4]   Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps [J].
Brameier, Markus ;
Wiuf, Carsten .
JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (02) :160-173
[5]  
Budanitsky A., 2001, WORKSH WORDNET OTH L
[6]  
Cheng Jill, 2004, J Biopharm Stat, V14, P687, DOI 10.1081/BIP-200025659
[7]   The transcriptional program of sporulation in budding yeast [J].
Chu, S ;
DeRisi, J ;
Eisen, M ;
Mulholland, J ;
Botstein, D ;
Brown, PO ;
Herskowitz, I .
SCIENCE, 1998, 282 (5389) :699-705
[8]   The Nuclear Protein Database (NPD): sub-nuclear localisation and functional annotation of the nuclear proteome [J].
Dellaire, G ;
Farrall, R ;
Bickmore, WA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :328-330
[9]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[10]   CYGD:: the comprehensive Yeast Genome Database [J].
Güldener, U ;
Münsterkötter, M ;
Kastenmüller, G ;
Strack, N ;
van Helden, J ;
Lemer, C ;
Richelles, J ;
Wodak, SJ ;
García-Martínez, J ;
Pérez-Ortín, JE ;
Michael, H ;
Kaps, A ;
Talla, E ;
Dujon, B ;
André, B ;
Souciet, JL ;
De Montigny, J ;
Bon, E ;
Gaillardin, C ;
Mewes, HW .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D364-D368