Systematic approaches for incorporating control spots and data quality information to improve normalization of cDNA microarray data

被引:1
作者
Wang, D.
Zhang, C.-H.
Soares, M. B.
Huang, J.
机构
[1] Univ Alabama Birmingham, Ctr Comprehens Canc, Biostat & Bioinformat Unit, Birmingham, AL 35294 USA
[2] Rutgers State Univ, Dept Stat, Piscataway, NJ USA
[3] Northwestern Univ, Dept Biochem Mol Biol & Cell Biol, Chicago, IL USA
[4] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA USA
关键词
microarray; normalization; quality control; spike; two-way semi-linear model;
D O I
10.1080/10543400701199544
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Background: Normalization and data quality control are two important aspects in microarray data analysis. Proper normalization and data quality control ensure that intensity ratios provide meaningful and accurate measurement of relative gene expression values. Control spots such as spikes and housekeeping genes with known concentrations in two channels are often used for calibrating experimental parameters. They provide valuable information about experimental variation which can be utilized for better normalization. They are also needed for proper normalization in cases that the most of the spots tend to change in one direction. In addition, it is desirable to include information on spot quality. Such information is available in a typical microarray data set, but is not fully utilized by existing normalization methods. Results: We propose two extensions of the two-way semi-linear model (TW-SLM) for appropriately combining control genes and spot quality information in normalization. The first extension (TW-SLMC) is designed to systematically incorporate control spots in a semi-parametric model to calibrate estimated normalization curves so that the relative fold changes of gene expressions are accurately estimated. Extrapolation is not required in this approach. The second extension (TW-SLMQ) is proposed to incorporate spot quality measure into normalization. This approach down-weights spots with lower quality scores in normalization. These two extensions can be used simultaneously for normalizing a data set. Two microarray data sets are used to demonstrate the proposed methods. Availability: An R based computing package is developed for the proposed methods and available from the corresponding authors.
引用
收藏
页码:415 / 431
页数:17
相关论文
共 34 条
  • [21] Identifying mislabeled and contaminated DNA methylation microarray data: an extended quality control toolset with examples from GEO
    Heiss, Jonathan A.
    Just, Allan C.
    CLINICAL EPIGENETICS, 2018, 10
  • [22] AIAP: A Quality Control and Integrative Analysis Package to Improve ATAC-seq Data Analysis
    Liu, Shaopeng
    Li, Daofeng
    Lyu, Cheng
    Gontarz, Paul M. M.
    Miao, Benpeng
    Madden, Pamela A. F.
    Wang, Ting
    Zhang, Bo
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2021, 19 (04) : 641 - 651
  • [23] Automated processing of subcontractor work performance data to improve the quality control and support the subcontractor selection process
    Vilutiene, Tatjana
    25TH INTERNATIONAL SYMPOSIUM ON AUTOMATION AND ROBOTICS IN CONSTRUCTION - ISARC-2008, 2008, : 507 - +
  • [24] Three-color cDNA microarrays with prehybridization quality control yield gene expression data comparable to that of commercial platforms
    Hessner, MJ
    Xiang, BX
    Jia, SA
    Geoffrey, R
    Holmes, S
    Meyer, L
    Muheisen, S
    Wang, XJ
    PHYSIOLOGICAL GENOMICS, 2006, 25 (01) : 166 - 178
  • [25] Employing quality control and feedback to the EQ-5D-5L valuation protocol to improve the quality of data collection
    Purba, Fredrick Dermawan
    Hunfeld, Joke A. M.
    Iskandarsyah, Aulia
    Fitriana, Titi Sahidah
    Sadarjoen, Sawitri S.
    Passchier, Jan
    Busschbach, Jan J. V.
    QUALITY OF LIFE RESEARCH, 2017, 26 (05) : 1197 - 1208
  • [26] Employing quality control and feedback to the EQ-5D-5L valuation protocol to improve the quality of data collection
    Fredrick Dermawan Purba
    Joke A. M. Hunfeld
    Aulia Iskandarsyah
    Titi Sahidah Fitriana
    Sawitri S. Sadarjoen
    Jan Passchier
    Jan J. V. Busschbach
    Quality of Life Research, 2017, 26 : 1197 - 1208
  • [27] Data quality control with multi-source information for FY-3 microwave sounder observations
    Li, Xiaoqing
    Wu Chunqiang
    Lu Qifeng
    Hui, Liu
    Liu Ruixia
    REMOTE SENSING OF CLOUDS AND THE ATMOSPHERE XXIV, 2019, 11152
  • [28] Proposal of geographic information systems methodology for quality control procedures of data obtained in naturalistic driving studies
    Balsa-Barreiro, Jose
    Valero-Mora, Pedro M.
    Pareja-Montoro, Ignacio
    Sanchez-Garcia, Mar
    IET INTELLIGENT TRANSPORT SYSTEMS, 2015, 9 (07) : 673 - 682
  • [29] HTS quality control and data analysis: A process to maximize information from a high-throughput screen
    Padmanabha, R
    Cook, L
    Gill, J
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2005, 8 (06) : 521 - 527
  • [30] Quality Control of Soil Water Data in Applied Climate Information System-Case Study in Nebraska
    You, Jinshing
    Hubbard, Kenneth G.
    Mahmood, Rezaul
    Sridhar, Venkataramana
    Todey, Dennis
    JOURNAL OF HYDROLOGIC ENGINEERING, 2010, 15 (03) : 200 - 209