Independent component analysis: Mining microarray data for fundamental human gene expression modules

被引:65
作者
Engreitz, Jesse M. [1 ]
Daigle, Bernie J., Jr. [2 ]
Marshall, Jonathan J. [1 ]
Altman, Russ B. [1 ,2 ]
机构
[1] Stanford Univ, Dept Bioengn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Genet, Sch Med, Stanford, CA 94305 USA
关键词
Microarrays; Independent component analysis; Data mining; Parthenolide; Gene modules; SESQUITERPENE LACTONE PARTHENOLIDE; NF-KAPPA-B; ACUTE MYELOGENOUS LEUKEMIA; ACUTE MYELOID-LEUKEMIA; TRANSCRIPTION FACTOR; STEM-CELLS; PROSTATE-CANCER; HUMAN GENOME; APOPTOSIS; PROFILES;
D O I
10.1016/j.jbi.2010.07.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
As public microarray repositories rapidly accumulate gene expression data, these resources contain increasingly valuable information about cellular processes in human biology This presents a unique opportunity for intelligent data mining methods to extract information about the transcriptional modules underlying these biological processes Modeling cellular gene expression as a combination of functional modules, we use Independent component analysis (ICA) to derive 423 fundamental components of human biology from a 9395-array compendium of heterogeneous expression data Annotation using the Gene Ontology (GO) suggests that while sonic of these components represent known biological modules, others may describe biology not well characterized by existing manually-curated ontologies In order to understand the biological functions represented by these modules, we investigate the mechanism of the preclinical anti-cancer drug parthenolide (PTL) by analyzing the differential expression of our fundamental components Our method correctly identifies known pathways and predicts that N-glycan biosynthesis and T-cell receptor signaling may contribute to PTL response The fundamental gene modules we describe have the potential to provide pathway-level insight into new gene expression datasets (C) 2010 Elsevier Inc All rights reserved
引用
收藏
页码:932 / 944
页数:13
相关论文
共 50 条
[31]   Microarray data classification based on ensemble independent component selection [J].
Liu, Kun-Hong ;
Li, Bo ;
Wu, Qing-Qiang ;
Zhang, Jun ;
Du, Ji-Xiang ;
Liu, Guo-Yan .
COMPUTERS IN BIOLOGY AND MEDICINE, 2009, 39 (11) :953-960
[32]   Integrating the Principal Component Analysis with Partial Decision Tree in Microarray Gene Data [J].
Al-Batah, Mohammad Subhi .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (03) :24-29
[33]   Application of Transcriptional Gene Modules to Analysis of Caenorhabditis elegans' Gene Expression Data [J].
Cary, Michael ;
Podshivalova, Katie ;
Kenyon, Cynthia .
G3-GENES GENOMES GENETICS, 2020, 10 (10) :3623-3638
[34]   Comparative Analysis of Data Mining Algorithms for Cancer Gene Expression Data [J].
Thareja, Preeti ;
Chhillar, Rajender Singh .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (10) :322-328
[35]   Gene expression profile analysis of pancreatic cancer based on microarray data [J].
Long, Jin ;
Liu, Zhe ;
Wu, Xingda ;
Xu, Yuanhong ;
Ge, Chunlin .
MOLECULAR MEDICINE REPORTS, 2016, 13 (05) :3913-3919
[36]   Analyzing time-dependent microarray data using independent component analysis derived expression modes from human macrophages infected with F. tularensis holartica [J].
Lutter, D. ;
Langmann, Th. ;
Ugocsai, P. ;
Moehle, C. ;
Seibold, E. ;
Splettstoesser, W. D. ;
Gruber, P. ;
Lang, E. W. ;
Schmitz, G. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (04) :605-611
[37]   Vector Quantization of Microarray Gene Expression Data [J].
Prasad, T. V. ;
Kohli, Maitrei .
WORLD CONGRESS ON ENGINEERING, WCE 2010, VOL I, 2010, :231-235
[38]   Mining yeast gene microarray data with latent variable models [J].
Staiano, Antonino ;
Tagliaferri, Roberto ;
De Vinco, Lara ;
Ciaramella, Angelo ;
Raiconi, Giancarlo ;
Longo, Giuseppe ;
Miele, Gennaro ;
Amato, Roberto ;
Del Mondo, Carmine ;
Donalek, Ciro ;
Mangano, Gianpiero ;
Di Bernardo, Diego .
Biological and Artificial Intelligence Environments, 2005, :81-89
[39]   Integrated analysis of DNA copy number and gene expression microarray data using gene sets [J].
Menezes, Renee X. ;
Boetzer, Marten ;
Sieswerda, Melle ;
van Ommen, Gert-Jan B. ;
Boer, Judith M. .
BMC BIOINFORMATICS, 2009, 10
[40]   Identification of novel human glioblastoma-specific transcripts by serial analysis of gene expression data mining [J].
Su, Yanlin ;
Xiong, Jie ;
Bing, Zhitong ;
Zeng, Xiaomin ;
Zhang, Yong ;
Fu, Xiaohua ;
Peng, Xiaoning .
CANCER BIOMARKERS, 2013, 13 (05) :367-375