Large-scale integration of microarray data reveals genes and pathways common to multiple cancer types

被引:37
作者
Dawany, Noor B. [1 ]
Dampier, Will N. [1 ]
Tozeren, Aydin [1 ]
机构
[1] Drexel Univ, Ctr Integrated Bioinformat, Sch Biomed Engn Sci & Hlth Syst, Philadelphia, PA 19104 USA
关键词
cancer; microarray; large-scale; meta-analysis; EXPRESSION PROFILES; METAANALYSIS; DISCOVERY;
D O I
10.1002/ijc.25854
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
The global gene expression analysis of cancer and healthy tissues typically results in large numbers of genes that are significantly altered in cancer. Such data, however, has been difficult to interpret due to the high level of variation of gene lists across laboratories and the small sample sizes used in individual studies. In this investigation, we compiled microarray data obtained from the same platform family from 84 laboratories, resulting in a database containing 1,043 healthy tissue samples and 4,900 cancer samples for 13 different tissue types. The primary cancers considered included adrenal gland, brain, breast, cervix, colon, kidney, liver, lung, ovary, pancreas, prostate and skin tissues. We normalized the data together and analyzed subsets for the discovery of genes involved in normal to cancer transformation. Our integrated significance analysis of microarrays approach produced top 400 gene lists for each of the 13 cancer types. These lists were highly statistically enriched with genes already associated with cancer in research publications excluding microarray studies (p < 1.31 E - 12). The genes MTIM and RRM2 appeared in nine and TOP2A in eight lists of significantly altered genes in cancer. In total, there were 132 genes present in at least four gene lists, 11 of which were not previously associated with cancer. The list contains 17 metal ions and 15 adenyl ribonucleotide binding proteins, six kinases and six transcription factors. Our results point to the value of integrating microarray data in the study of combination drug therapies targeting metastasis.
引用
收藏
页码:2881 / 2891
页数:11
相关论文
共 29 条
[1]   NCBI GEO: mining tens of millions of expression profiles - database and tools update [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D760-D765
[2]   A-MADMAN: Annotation-based microarray data meta-analysis tool [J].
Bisognin, Andrea ;
Coppe, Alessandro ;
Ferrari, Francesco ;
Risso, Davide ;
Romualdi, Chiara ;
Bicciato, Silvio ;
Bortoluzzi, Stefania .
BMC BIOINFORMATICS, 2009, 10
[3]   ArrayExpress - a public repository for microarray gene expression data at the EBI [J].
Brazma, A ;
Parkinson, H ;
Sarkans, U ;
Shojatalab, M ;
Vilo, J ;
Abeygunawardena, N ;
Holloway, E ;
Kapushesky, M ;
Kemmeren, P ;
Lara, GG ;
Oezcimen, A ;
Rocca-Serra, P ;
Sansone, SA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :68-71
[4]   Integrative analysis of multiple gene expression profiles applied to liver cancer study [J].
Choi, JK ;
Choi, JY ;
Kim, DG ;
Choi, DW ;
Kim, BY ;
Lee, KH ;
Yeom, YI ;
Yoo, HS ;
Yoo, OJ ;
Kim, S .
FEBS LETTERS, 2004, 565 (1-3) :93-100
[5]   Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data [J].
Dai, MH ;
Wang, PL ;
Boyd, AD ;
Kostov, G ;
Athey, B ;
Jones, EG ;
Bunney, WE ;
Myers, RM ;
Speed, TP ;
Akil, H ;
Watson, SJ ;
Meng, F .
NUCLEIC ACIDS RESEARCH, 2005, 33 (20) :e175.1-e175.9
[6]   Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types [J].
Dawany, Noor B. ;
Tozeren, Aydin .
BMC BIOINFORMATICS, 2010, 11
[7]  
DeConde R, 2006, STAT APPL GENET MOL, V5
[8]   DAVID: Database for annotation, visualization, and integrated discovery [J].
Dennis, G ;
Sherman, BT ;
Hosack, DA ;
Yang, J ;
Gao, W ;
Lane, HC ;
Lempicki, RA .
GENOME BIOLOGY, 2003, 4 (09)
[9]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[10]   Candidate pathways and genes for prostate cancer: a meta-analysis of gene expression data [J].
Gorlov, Ivan P. ;
Byun, Jinyoung ;
Gorlova, Olga Y. ;
Aparicio, Ana M. ;
Efstathiou, Eleni ;
Logothetis, Christopher J. .
BMC MEDICAL GENOMICS, 2009, 2