Detecting and overcoming systematic bias in high-throughput screening technologies: a comprehensive review of practical issues and methodological solutions

被引:31
作者
Caraus, Iurie [1 ]
Alsuwailem, Abdulaziz A. [2 ]
Nadon, Robert [2 ,3 ]
Makarenkov, Vladimir [4 ]
机构
[1] Univ Quebec, Dept Comp Sci, Montreal, PQ H3C 3P8, Canada
[2] McGill Univ, Dept Human Genet, Montreal, PQ H3A 2T5, Canada
[3] Genome Quebec Innovat Ctr, Montreal, PQ, Canada
[4] Univ Quebec, Dept Comp Sci, Grad Bioinformat Program, Montreal, PQ H3C 3P8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
data correction methods; data normalization methods; high-content screening (HCS); high-throughput screening (HTS); systematic error; RNA INTERFERENCE SCREENS; SMALL-MOLECULE LIBRARIES; STATISTICAL-METHODS; HIT SELECTION; MICROARRAY EXPERIMENTS; POPULATION CONTEXT; DESIGN; CELLS; IDENTIFICATION; COMPONENTS;
D O I
10.1093/bib/bbv004
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Significant efforts have been made recently to improve data throughput and data quality in screening technologies related to drug design. The modern pharmaceutical industry relies heavily on high-throughput screening (HTS) and high-content screening (HCS) technologies, which include small molecule, complementary DNA (cDNA) and RNA interference (RNAi) types of screening. Data generated by these screening technologies are subject to several environmental and procedural systematic biases, which introduce errors into the hit identification process. We first review systematic biases typical of HTS and HCS screens. We highlight that study design issues and the way in which data are generated are crucial for providing unbiased screening results. Considering various data sets, including the publicly available ChemBank data, we assess the rates of systematic bias in experimental HTS by using plate-specific and assay-specific error detection tests. We describe main data normalization and correction techniques and introduce a general data preprocessing protocol. This protocol can be recommended for academic and industrial researchers involved in the analysis of current or next-generation HTS data.
引用
收藏
页码:974 / 986
页数:13
相关论文
共 76 条
[1]  
Amberkar Sandeep, 2013, World J Virol, V2, P18, DOI 10.5501/wjv.v2.i2.18
[2]  
[Anonymous], 1994, Statistics
[3]  
[Anonymous], 1986, Statistics: Textbooks and Monographs, Vol. 68
[4]  
[Anonymous], 1925, STAT METHODS RES WOR
[5]   High-throughput RNA interference screens in Drosophila tissue culture cells [J].
Armknecht, S ;
Boutros, M ;
Kiger, A ;
Nybakken, K ;
Mathey-Prevot, B ;
Perrimon, N .
RNA INTERFERENCE, 2005, 392 :55-+
[6]   Statistical Design and Analysis of RNA Sequencing Data [J].
Auer, Paul L. ;
Doerge, R. W. .
GENETICS, 2010, 185 (02) :405-U32
[7]   Quantitative analysis of fitness and genetic interactions in yeast on a genome scale [J].
Baryshnikova, Anastasia ;
Costanzo, Michael ;
Kim, Yungil ;
Ding, Huiming ;
Koh, Judice ;
Toufighi, Kiana ;
Youn, Ji-Young ;
Ou, Jiongwen ;
San Luis, Bryan-Joseph ;
Bandyopadhyay, Sunayan ;
Hibbs, Matthew ;
Hess, David ;
Gingras, Anne-Claude ;
Bader, Gary D. ;
Troyanskaya, Olga G. ;
Brown, Grant W. ;
Andrews, Brenda ;
Boone, Charles ;
Myers, Chad L. .
NATURE METHODS, 2010, 7 (12) :1017-U110
[8]   Statistical methods for analysis of high-throughput RNA interference screens [J].
Birmingham, Amanda ;
Selfors, Laura M. ;
Forster, Thorsten ;
Wrobel, David ;
Kennedy, Caleb J. ;
Shanks, Emma ;
Santoyo-Lopez, Javier ;
Dunican, Dara J. ;
Long, Aideen ;
Kelleher, Dermot ;
Smith, Queta ;
Beijersbergen, Roderick L. ;
Ghazal, Peter ;
Shamu, Caroline E. .
NATURE METHODS, 2009, 6 (08) :569-575
[9]   Replication [J].
Blainey, Paul ;
Krzywinski, Martin ;
Altman, Naomi .
NATURE METHODS, 2014, 11 (09) :879-880
[10]   The art and design of genetic screens: RNA interference [J].
Boutros, Michael ;
Ahringer, Julie .
NATURE REVIEWS GENETICS, 2008, 9 (07) :554-566