Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data

被引:9
|
作者
Wilson, Tyler J. [1 ]
Lai, Liming [1 ]
Ban, Yuguang [1 ]
Ge, Steven X. [1 ]
机构
[1] S Dakota State Univ, Dept Math & Stat, Brookings, SD 57007 USA
来源
BMC GENOMICS | 2012年 / 13卷
基金
美国国家卫生研究院;
关键词
BIOINFORMATICS; DISCOVERY; NETWORK; REVEALS; BIOLOGY; TOOLS; LISTS;
D O I
10.1186/1471-2164-13-237
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Many plant genes have been identified through whole genome and deep transcriptome sequencing and other methods; yet our knowledge on the function of many of these genes remains limited. The integration and analysis of large gene-expression datasets gives researchers the ability to formalize hypotheses concerning the functionality and interaction between different groups of correlated genes. Results: We applied the non-negative matrix factorization (NMF) algorithm to the AtGenExpress dataset which consists of 783 microarray samples (29 separate experimental series) conducted on the model plant Arabidopsis thaliana. We identified 15 metagenes, which are groups of genes with correlated expression. Functional roles of these metagenes are established by observing the enriched gene ontology (GO) categories using gene set enrichment analyses (GSEA). Activity levels of these metagenes in various experimental conditions are also analyzed to associate metagenes with stimuli/conditions. A metagene correlation network, constructed based on the results of NMF analysis, revealed many new interactions between the metagenes. Comparison of these metagenes with an earlier large-scale clustering analysis indicates many statistically significant overlaps. Conclusions: This study identifies a network of correlated metagenes composed of Arabidopsis genes acting in a highly correlated fashion across a broad spectrum of experimental stimuli, which may shed some light on the function of many of the un-annotated genes.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Identification of metagenes and their Interactions through Large-scale Analysis of Arabidopsis Gene Expression Data
    Tyler J Wilson
    Liming Lai
    Yuguang Ban
    Steven X Ge
    BMC Genomics, 13
  • [2] Subsystem identification through dimensionality reduction of large-scale gene expression data
    Kim, PM
    Tidor, B
    GENOME RESEARCH, 2003, 13 (07) : 1706 - 1718
  • [3] Analysis of large-scale gene expression data
    Sherlock, G
    CURRENT OPINION IN IMMUNOLOGY, 2000, 12 (02) : 201 - 205
  • [4] Finding regulatory modules through large-scale gene-expression data analysis
    Kloster, M
    Tang, C
    Wingreen, NS
    BIOINFORMATICS, 2005, 21 (07) : 1172 - 1179
  • [5] Challenges and prospects in the analysis of large-scale gene expression data
    Ihmeis, JH
    Bergmann, S
    BRIEFINGS IN BIOINFORMATICS, 2004, 5 (04) : 313 - 327
  • [6] Exploiting Scientific Workflows for Large-scale Gene Expression Data Analysis
    De Stasio, Alessandro
    Ertelt, Marcus
    Kemmner, Wolfgang
    Leser, Ulf
    Ceccarelli, Michele
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 447 - +
  • [7] Iterative signature algorithm for the analysis of large-scale gene expression data
    Bergmann, S
    Ihmels, J
    Barkai, N
    PHYSICAL REVIEW E, 2003, 67 (03):
  • [8] Large-scale gene expression data clustering through incremental ensemble approach
    Khan, Imran
    Shaikh, Abdul Khalique
    Adhikari, Naresh
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (04):
  • [9] Large-scale analysis of the GRAS gene family in Arabidopsis thaliana
    Mi-Hyun Lee
    Bohye Kim
    Sang-Kee Song
    Jung-Ok Heo
    Nan-Ie Yu
    Shin Ae Lee
    Miran Kim
    Dong Gwan Kim
    Sung Oh Sohn
    Chae Eun Lim
    Kwang Suk Chang
    Myeong Min Lee
    Jun Lim
    Plant Molecular Biology, 2008, 67 : 659 - 670
  • [10] Large-Scale Identification and Analysis of Suppressive Drug Interactions
    Coko, Murat
    Weinstein, Zohar B.
    Yilancioglu, Kaan
    Tasan, Murat
    Doak, Allison
    Cansever, Dilay
    Mutlu, Beste
    Li, Siyang
    Rodriguez-Esteban, Raul
    Akhmedov, Murodzhon
    Guvenek, Aysegul
    Cokol, Melike
    Cetiner, Selim
    Giaever, Guri
    Iossifov, Ivan
    Nislow, Corey
    Shoichet, Brian
    Roth, Frederick P.
    CHEMISTRY & BIOLOGY, 2014, 21 (04): : 541 - 551