Empirical comparison of cross-platform normalization methods for gene expression data

被引:71
|
作者
Rudy, Jason [1 ]
Valafar, Faramarz [1 ]
机构
[1] San Diego State Univ, Biomed Informat Res Ctr, San Diego, CA 92182 USA
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
MICROARRAY DATA; METAANALYSIS; CANCER; AFFYMETRIX; REPRODUCIBILITY; CONCORDANCE; VALIDATION; PROFILES; SETS;
D O I
10.1186/1471-2105-12-467
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Simultaneous measurement of gene expression on a genomic scale can be accomplished using microarray technology or by sequencing based methods. Researchers who perform high throughput gene expression assays often deposit their data in public databases, but heterogeneity of measurement platforms leads to challenges for the combination and comparison of data sets. Researchers wishing to perform cross platform normalization face two major obstacles. First, a choice must be made about which method or methods to employ. Nine are currently available, and no rigorous comparison exists. Second, software for the selected method must be obtained and incorporated into a data analysis workflow. Results: Using two publicly available cross-platform testing data sets, cross-platform normalization methods are compared based on inter-platform concordance and on the consistency of gene lists obtained with transformed data. Scatter and ROC-like plots are produced and new statistics based on those plots are introduced to measure the effectiveness of each method. Bootstrapping is employed to obtain distributions for those statistics. The consistency of platform effects across studies is explored theoretically and with respect to the testing data sets. Conclusions: Our comparisons indicate that four methods, DWD, EB, GQ, and XPN, are generally effective, while the remaining methods do not adequately correct for platform effects. Of the four successful methods, XPN generally shows the highest inter-platform concordance when treatment groups are equally sized, while DWD is most robust to differently sized treatment groups and consistently shows the smallest loss in gene detection. We provide an R package, CONOR, capable of performing the nine cross-platform normalization methods considered. The package can be downloaded at http://alborz.sdsu.edu/conor and is available from CRAN.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Empirical comparison of cross-platform normalization methods for gene expression data
    Jason Rudy
    Faramarz Valafar
    BMC Bioinformatics, 12
  • [2] MatchMixeR: a cross-platform normalization method for gene expression data integration
    Zhang, Serin
    Shao, Jiang
    Yu, Disa
    Qiu, Xing
    Zhang, Jinfeng
    BIOINFORMATICS, 2020, 36 (08) : 2486 - 2491
  • [3] CuBlock: a cross-platform normalization method for gene-expression microarrays
    Junet, Valentin
    Farres, Judith
    Mas, Jose M.
    Daura, Xavier
    BIOINFORMATICS, 2021, 37 (16) : 2365 - 2373
  • [4] Cross-Platform Analysis with Binarized Gene Expression Data
    Tuna, Salih
    Niranjan, Mahesan
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2009, 5780 : 439 - 449
  • [5] Merging two gene-expression studies via cross-platform normalization
    Shabalin, Andrey A.
    Tjelmeland, Hakon
    Fan, Cheng
    Perou, Charles M.
    Nobel, Andrew B.
    BIOINFORMATICS, 2008, 24 (09) : 1154 - 1160
  • [6] PLIDA: cross-platform gene expression normalization using perturbed topic models
    Deshwar, Amit G.
    Morris, Quaid
    BIOINFORMATICS, 2014, 30 (07) : 956 - 961
  • [7] Molecular classification of lung cancer - A cross-platform comparison of gene expression data sets
    Parmigiani, G
    Garrett, E
    Anbazhagan, B
    Gabrielson, E
    CHEST, 2004, 125 (05) : 103S - 103S
  • [8] Feature specific quantile normalization enables cross-platform classification of molecular subtypes using gene expression data
    Franks, Jennifer M.
    Cai, Guoshuai
    Whitfield, Michael L.
    BIOINFORMATICS, 2018, 34 (11) : 1868 - 1874
  • [9] Cross-platform comparison and visualisation of gene expression data using co-inertia analysis
    Aedín C Culhane
    Guy Perrière
    Desmond G Higgins
    BMC Bioinformatics, 4
  • [10] Differential network analysis from cross-platform gene expression data
    Zhang, Xiao-Fei
    Le Ou-Yang
    Zhao, Xing-Ming
    Yan, Hong
    SCIENTIFIC REPORTS, 2016, 6