Annotation Enrichment Analysis: An Alternative Method for Evaluating the Functional Properties of Gene Sets

被引:53
作者
Glass, Kimberly [1 ,2 ,3 ]
Girvan, Michelle [3 ,4 ,5 ]
机构
[1] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
[2] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[3] Univ Maryland, Dept Phys, College Pk, MD 20742 USA
[4] Univ Maryland, Inst Phys Sci & Technol, College Pk, MD 20742 USA
[5] Santa Fe Inst, Santa Fe, NM 87501 USA
关键词
T-CELL-LYMPHOMA; BREAST-CANCER; EXPRESSION PROFILES; DIFFERENTIAL EXPRESSION; MOLECULAR SIGNATURE; CURATED DATABASE; P53; STATUS; PROGNOSIS; PATTERNS; PROLIFERATION;
D O I
10.1038/srep04191
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene annotation databases (compendiums maintained by the scientific community that describe the biological functions performed by individual genes) are commonly used to evaluate the functional properties of experimentally derived gene sets. Overlap statistics, such as Fishers Exact test (FET), are often employed to assess these associations, but don't account for non-uniformity in the number of genes annotated to individual functions or the number of functions associated with individual genes. We find FET is strongly biased toward over-estimating overlap significance if a gene set has an unusually high number of annotations. To correct for these biases, we develop Annotation Enrichment Analysis (AEA), which properly accounts for the non-uniformity of annotations. We show that AEA is able to identify biologically meaningful functional enrichments that are obscured by numerous false-positive enrichment scores in FET, and we therefore suggest it be used to more accurately assess the biological properties of gene sets.
引用
收藏
页数:9
相关论文
共 73 条
[1]   Improved scoring of functional groups from gene expression data by decorrelating GO graph structure [J].
Alexa, Adrian ;
Rahnenfuehrer, Joerg ;
Lengauer, Thomas .
BIOINFORMATICS, 2006, 22 (13) :1600-1607
[2]  
Ashburner M, 2001, GENOME RES, V11, P1425, DOI 10.1101/gr.180801
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   Gene expression patterns that predict sensitivity to epidermal growth factor receptor tyrosine kinase inhibitors in lung cancer cell lines and human lung tumors [J].
Balko, Justin M. ;
Potti, Anil ;
Saunders, Christopher ;
Stromberg, Arnold ;
Haura, Eric B. ;
Black, Esther P. .
BMC GENOMICS, 2006, 7 (1)
[5]   GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[6]   The Gene Ontology in 2010: extensions and refinements The Gene Ontology Consortium [J].
Berardini, Tanya Z. ;
Li, Donghui ;
Huala, Eva ;
Bridges, Susan ;
Burgess, Shane ;
McCarthy, Fiona ;
Carbon, Seth ;
Lewis, Suzanna E. ;
Mungall, Christopher J. ;
Abdulla, Amina ;
Wood, Valerie ;
Feltrin, Erika ;
Valle, Giorgio ;
Chisholm, Rex L. ;
Fey, Petra ;
Gaudet, Pascale ;
Kibbe, Warren ;
Basu, Siddhartha ;
Bushmanova, Yulia ;
Eilbeck, Karen ;
Siegele, Deborah A. ;
McIntosh, Brenley ;
Renfro, Daniel ;
Zweifel, Adrienne ;
Hu, James C. ;
Ashburner, Michael ;
Tweedie, Susan ;
Alam-Faruque, Yasmin ;
Apweiler, Rolf ;
Auchinchloss, Andrea ;
Bairoch, Amos ;
Barrell, Daniel ;
Binns, David ;
Blatter, Marie-Claude ;
Bougueleret, Lydie ;
Boutet, Emmanuel ;
Breuza, Lionel ;
Bridge, Alan ;
Browne, Paul ;
Chan, Wei Mun ;
Coudert, Elizabeth ;
Daugherty, Louise ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Estreicher, Anne ;
Famiglietti, Livia ;
Ferro-Rojas, Serenella ;
Feuermann, Marc ;
Foulger, Rebecca ;
Gruaz-Gumowski, Nadine .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D331-D335
[7]   Extracellular matrix signature identifies breast cancer subgroups with different clinical outcome [J].
Bergamaschi, A. ;
Tagliabue, E. ;
Sorlie, T. ;
Naurne, B. ;
Triulzi, T. ;
Orlandi, R. ;
Russnes, H. G. ;
Nesland, J. M. ;
Tammi, R. ;
Auvinen, P. ;
Kosma, V-M ;
Menard, S. ;
Borresen-Dale, A-L .
JOURNAL OF PATHOLOGY, 2008, 214 (03) :357-367
[8]   Prognosis of breast cancer and gene expression profiling using DNA arrays [J].
Bertucci, F ;
Houlgatte, R ;
Granjeaud, S ;
Nasser, V ;
Loriod, B ;
Beaudoing, E ;
Hingamp, P ;
Jacquemier, J ;
Viens, P ;
Birnbaum, D ;
Nguyen, C .
MICROARRAYS, IMMUNE RESPONSES AND VACCINES, 2002, 975 :217-231
[9]   Up regulated expression of tumour necrosis factor α converting enzyme in peripheral monocytes of patients with early systemic sclerosis [J].
Bohgaki, T ;
Amasaki, Y ;
Nishimura, N ;
Bohgaki, M ;
Yamashita, Y ;
Nishio, M ;
Sawada, K ;
Jodo, S ;
Atsumi, T ;
Koike, T .
ANNALS OF THE RHEUMATIC DISEASES, 2005, 64 (08) :1165-1173
[10]   Hepatic Stem-like Phenotype and Interplay of Wnt/β-Catenin and Myc Signaling in Aggressive Childhood Liver Cancer [J].
Cairo, Stefano ;
Armengol, Carolina ;
De Reynies, Auelien ;
Wei, Yu ;
Thomas, Emilie ;
Renard, Claire-Angelique ;
Goga, Andrei ;
Balakrishnan, Asha ;
Semeraro, Michaela ;
Gresh, Lionel ;
Pontoglio, Marco ;
Strick-Marchand, Helene ;
Levillayer, Florence ;
Nouet, Yann ;
Rickman, David ;
Gauthier, Frederic ;
Branchereau, Sophie ;
Brugieres, Laurence ;
Laithier, Veronique ;
Bouvier, Raymonde ;
Boman, Francoise ;
Basso, Giuseppe ;
Michiels, Jean-Francois ;
Hofman, Paul ;
Arbez-Gindre, Francine ;
Jouan, Helene ;
Chapeau, Marie-Christine Rousselet ;
Berrebi, Dominique ;
Marcellin, Luc ;
Plenat, Francois ;
Zachar, Dominique ;
Joubert, Madeleine ;
Selves, Janick ;
Pasquier, Dominique ;
Bioulac-Sage, Paulette ;
Grotzer, Michael ;
Childs, Margaret ;
Fabre, Monique ;
Buendia, Marie-Annick .
CANCER CELL, 2008, 14 (06) :471-484