Repeatability of published microarray gene expression analyses

被引:349
作者
Ioannidis, John P. A. [1 ,2 ,3 ,4 ]
Allison, David B. [5 ]
Ball, Catherine A. [6 ]
Coulibaly, Issa [5 ]
Cui, Xiangqin [5 ]
Culhane, Aedin C. [7 ,8 ]
Falchi, Mario [9 ,10 ]
Furlanello, Cesare [11 ]
Game, Laurence [12 ]
Jurman, Giuseppe [11 ]
Mangion, Jon [12 ]
Mehta, Tapan [5 ]
Nitzberg, Michael [6 ]
Page, Grier P. [5 ,13 ]
Petretto, Enrico [12 ,14 ]
van Noort, Vera [15 ]
机构
[1] Univ Ioannina, Sch Med, Clin & Mol Epidemiol Unit, Dept Hyg & Epidemiol, GR-45110 Ioannina, Greece
[2] Fdn Res & Technol Hellas, Biomed Res Inst, Ioannina 45110, Greece
[3] Tufts Univ, Sch Med, Ctr Genet Epidemiol & Modeling, Tufts Med Ctr, Boston, MA 02111 USA
[4] Tufts Univ, Sch Med, Dept Med, Boston, MA 02111 USA
[5] Univ Alabama, Dept Biostat, Sect Stat Genet, Birmingham, AL 35294 USA
[6] Stanford Univ, Dept Biochem, Sch Med, Stanford, CA 94305 USA
[7] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
[8] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[9] Univ London Imperial Coll Sci Technol & Med, Hammersmith Hosp, Fac Med, London W12 0NN, England
[10] Kings Coll London, Dept Twin Res & Genet Epidemiol, London SE1 7EH, England
[11] Fdn Bruno Kessler, I-38100 Trento, Italy
[12] Hammersmith Hosp, MRC, Ctr Clin Sci, Microarray Ctr, London W12 0NN, England
[13] RTI Int, Stat & Epidemiol Unit, Atlanta, GA 30341 USA
[14] Univ London Imperial Coll Sci Technol & Med, Fac Med, Dept Epidemiol Publ Hlth & Primary Care, London W2 1PG, England
[15] European Mol Biol Lab, D-69117 Heidelberg, Germany
基金
英国医学研究理事会;
关键词
GENOME-WIDE ANALYSIS; REPRODUCIBILITY; ASSOCIATIONS; ARRAYEXPRESS; INFORMATION; REPOSITORY; SIGNATURE; MIAME; CELLS;
D O I
10.1038/ng.295
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Given the complexity of microarray-based gene expression studies, guidelines encourage transparent design and public data availability. Several journals require public data deposition and several public databases exist. However, not all data are publicly available, and even when available, it is unknown whether the published results are reproducible by independent scientists. Here we evaluated the replication of data analyses in 18 articles on microarray-based gene expression profiling published in Nature Genetics in 2005-2006. One table or figure from each article was independently evaluated by two teams of analysts. We reproduced two analyses in principle and six partially or with some discrepancies; ten could not be reproduced. The main reason for failure to reproduce was data unavailability, and discrepancies were mostly due to incomplete data annotation or specification of data processing and analysis. Repeatability of published microarray studies is apparently limited. More strict publication rules enforcing public data availability and explicit description of data processing and analysis should be considered.
引用
收藏
页码:149 / 155
页数:7
相关论文
共 42 条
  • [1] Microarray data analysis: from disarray to consolidation and consensus
    Allison, DB
    Cui, XQ
    Page, GP
    Sabripour, M
    [J]. NATURE REVIEWS GENETICS, 2006, 7 (01) : 55 - 65
  • [2] Submission of microarray data to public repositories
    Ball, CA
    Brazma, A
    Causton, H
    Chervitz, S
    Edgar, R
    Hingamp, P
    Matese, JC
    Parkinson, H
    Quackenbush, J
    Ringwald, M
    Sansone, SA
    Sherlock, G
    Spellman, P
    Stoeckert, C
    Tateno, Y
    Taylor, R
    White, J
    Winegarden, N
    [J]. PLOS BIOLOGY, 2004, 2 (09) : 1276 - 1277
  • [3] Minimum information about a microarray experiment (MIAME) - toward standards for microarray data
    Brazma, A
    Hingamp, P
    Quackenbush, J
    Sherlock, G
    Spellman, P
    Stoeckert, C
    Aach, J
    Ansorge, W
    Ball, CA
    Causton, HC
    Gaasterland, T
    Glenisson, P
    Holstege, FCP
    Kim, IF
    Markowitz, V
    Matese, JC
    Parkinson, H
    Robinson, A
    Sarkans, U
    Schulze-Kremer, S
    Stewart, J
    Taylor, R
    Vilo, J
    Vingron, M
    [J]. NATURE GENETICS, 2001, 29 (04) : 365 - 371
  • [4] ArrayExpress - a public repository for microarray gene expression data at the EBI
    Brazma, A
    Parkinson, H
    Sarkans, U
    Shojatalab, M
    Vilo, J
    Abeygunawardena, N
    Holloway, E
    Kapushesky, M
    Kemmeren, P
    Lara, GG
    Oezcimen, A
    Rocca-Serra, P
    Sansone, SA
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 68 - 71
  • [5] ArrayExpress service for reviewers/editors of DNA microarray papers
    Brazma, Alvis
    Parkinson, Helen
    [J]. NATURE BIOTECHNOLOGY, 2006, 24 (11) : 1321 - 1322
  • [6] Genome-wide analysis of estrogen receptor binding sites
    Carroll, Jason S.
    Meyer, Clifford A.
    Song, Jun
    Li, Wei
    Geistlinger, Timothy R.
    Eeckhoute, Jerome
    Brodsky, Alexander S.
    Keeton, Erika Krasnickas
    Fertuck, Kirsten C.
    Hall, Giles F.
    Wang, Qianben
    Bekiranov, Stefan
    Sementchenko, Victor
    Fox, Edward A.
    Silver, Pamela A.
    Gingeras, Thomas R.
    Liu, X. Shirley
    Brown, Myles
    [J]. NATURE GENETICS, 2006, 38 (11) : 1289 - 1297
  • [7] Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data
    Chen, James J.
    Hsueh, Huey-Miin
    Delongchamp, Robert R.
    Lin, Chien-Ju
    Tsai, Chen-An
    [J]. BMC BIOINFORMATICS, 2007, 8 (1) : 1 - 14
  • [8] The transcriptional consequences of mutation and natural selection in Caenorhabditis elegans
    Denver, DR
    Morris, K
    Streelman, JT
    Kim, SK
    Lynch, M
    Thomas, WK
    [J]. NATURE GENETICS, 2005, 37 (05) : 544 - 548
  • [9] Molecular analysis of flies selected for aggressive behavior
    Dierick, Herman A.
    Greenspan, Ralph J.
    [J]. NATURE GENETICS, 2006, 38 (09) : 1023 - 1031
  • [10] Reliability and reproducibility issues in DNA microarray measurements
    Draghici, S
    Khatri, P
    Eklund, AC
    Szallasi, Z
    [J]. TRENDS IN GENETICS, 2006, 22 (02) : 101 - 109