Empirical comparison of cross-platform normalization methods for gene expression data

被引:71
|
作者
Rudy, Jason [1 ]
Valafar, Faramarz [1 ]
机构
[1] San Diego State Univ, Biomed Informat Res Ctr, San Diego, CA 92182 USA
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
MICROARRAY DATA; METAANALYSIS; CANCER; AFFYMETRIX; REPRODUCIBILITY; CONCORDANCE; VALIDATION; PROFILES; SETS;
D O I
10.1186/1471-2105-12-467
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Simultaneous measurement of gene expression on a genomic scale can be accomplished using microarray technology or by sequencing based methods. Researchers who perform high throughput gene expression assays often deposit their data in public databases, but heterogeneity of measurement platforms leads to challenges for the combination and comparison of data sets. Researchers wishing to perform cross platform normalization face two major obstacles. First, a choice must be made about which method or methods to employ. Nine are currently available, and no rigorous comparison exists. Second, software for the selected method must be obtained and incorporated into a data analysis workflow. Results: Using two publicly available cross-platform testing data sets, cross-platform normalization methods are compared based on inter-platform concordance and on the consistency of gene lists obtained with transformed data. Scatter and ROC-like plots are produced and new statistics based on those plots are introduced to measure the effectiveness of each method. Bootstrapping is employed to obtain distributions for those statistics. The consistency of platform effects across studies is explored theoretically and with respect to the testing data sets. Conclusions: Our comparisons indicate that four methods, DWD, EB, GQ, and XPN, are generally effective, while the remaining methods do not adequately correct for platform effects. Of the four successful methods, XPN generally shows the highest inter-platform concordance when treatment groups are equally sized, while DWD is most robust to differently sized treatment groups and consistently shows the smallest loss in gene detection. We provide an R package, CONOR, capable of performing the nine cross-platform normalization methods considered. The package can be downloaded at http://alborz.sdsu.edu/conor and is available from CRAN.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] MAGIC: access portal to a cross-platform gene expression compendium for maize
    Fu, Qiang
    Fierro, Ana Carolina
    Meysman, Pieter
    Sanchez-Rodriguez, Aminael
    Vandepoele, Klaas
    Marchal, Kathleen
    Engelen, Kristof
    BIOINFORMATICS, 2014, 30 (09) : 1316 - 1318
  • [22] Optimizing Cross-Platform Data Movement
    Kruse, Sebastian
    Kaoudi, Zoi
    Quiane-Ruiz, Jorge-Arnulfo
    Chawla, Sanjay
    Naumann, Felix
    Contreras-Rojas, Bertty
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1642 - 1645
  • [23] CrossICC: iterative consensus clustering of cross-platform gene expression data without adjusting batch effect
    Zhao, Qi
    Sun, Yu
    Liu, Zekun
    Zhang, Hongwan
    Li, Xingyang
    Zhu, Kaiyu
    Liu, Ze-Xian
    Ren, Jian
    Zuo, Zhixiang
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1818 - 1824
  • [24] Cross-platform comparison of microarray data using order restricted inference
    Klinglmueller, Florian
    Tuechler, Thomas
    Posch, Martin
    BIOINFORMATICS, 2011, 27 (07) : 953 - 960
  • [25] Cross-platform Data Analysis Reveals a Generic Gene Expression Signature for Microsatellite Instability in Colorectal Cancer
    Pacinkova, Anna
    Popovici, Vlad
    BIOMED RESEARCH INTERNATIONAL, 2019, 2019
  • [26] An Empirical Study of Cross-Platform Mobile Development in Industry
    Biorn-Hansen, Andreas
    Gronli, Tor-Morten
    Ghinea, Gheorghita
    Alouneh, Sahel
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2019,
  • [27] Methods of Cross-Platform Development Mobile Applications
    Ptitsyn, Pavel Sergeyevich
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2015, 6 (01): : 1803 - 1814
  • [28] Comparison of Data Discretization Methods for Cross Platform Transfer of Gene-expression based Tumor Subtyping Classifier
    Jung, Segun
    Bi, Yingtao
    Davuluri, Ramana V.
    2014 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2014,
  • [29] Sparse canonical methods for biological data integration: application to a cross-platform study
    Kim-Anh Lê Cao
    Pascal GP Martin
    Christèle Robert-Granié
    Philippe Besse
    BMC Bioinformatics, 10
  • [30] Comparison of normalization and models for the analysis of gene expression data
    Rodriguez-Zas, S. L.
    Band, M. R.
    Everts, R. E.
    Southey, B. R.
    Liu, Z. L.
    Lewin, H. A.
    JOURNAL OF DAIRY SCIENCE, 2004, 87 : 377 - 377