Evaluation of Five Methods for Genome-Wide Circadian Gene Identification

被引:32
作者
Wu, Gang [1 ]
Zhu, Jiang [2 ,3 ]
Yu, Jun [1 ]
Zhou, Lan [4 ]
Huang, Jianhua Z. [4 ]
Zhang, Zhang [1 ]
机构
[1] Chinese Acad Sci, Beijing Inst Genom, CAS Key Lab Genome Sci & Informat, Beijing 100101, Peoples R China
[2] Massachusetts Gen Hosp, Dept Pathol, Boston, MA 02114 USA
[3] Harvard Univ, Sch Med, Boston, MA USA
[4] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
关键词
circadian rhythms; comparison; circadian gene; ARSER; COSOPT; Fisher's G test; HAYSTACK; JTK_CYCLE; KEY PATHWAYS; EXPRESSION; TIME; RHYTHMS; TRANSCRIPTION; CLOCK; ALGORITHMS;
D O I
10.1177/0748730414537788
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identification of circadian-regulated genes based on temporal transcriptome data is important for studying the regulation mechanism of the circadian system. However, various computational methods adopting different strategies for the identification of cycling transcripts usually yield inconsistent results even for the same dataset, making it challenging to choose the optimal method for a specific circadian study. To address this challenge, we evaluate 5 popular methods, including ARSER (ARS), COSOPT (COS), Fisher's G test (FIS), HAYSTACK (HAY), and JTK_CYCLE (JTK), based on both simulated and empirical datasets. Our results show that increasing the number of total samples (through improving sampling frequency or lengthening the sampling time window) is beneficial for computational methods to accurately identify circadian transcripts and measure circadian phase. For a given number of total samples, higher sampling frequency is more important for HAY and JTK, and the longer sampling time window is more crucial for ARS and COS, as testified on simulated and empirical datasets from which circadian signals are computationally identified. In addition, the preference of higher sampling frequency or the longer sampling time window is also obvious for JTK, ARS, and COS in estimating circadian phases of simulated periodic profiles. Our results also indicate that attention should be paid to the significance threshold that is used for each method in selecting circadian genes, especially when analyzing the same empirical dataset with 2 or more methods. To summarize, for any study involving genome-wide identification of circadian genes from transcriptome data, our evaluation results provide suggestions for the selection of an optimal method based on specific goal and experimental design.
引用
收藏
页码:231 / 242
页数:12
相关论文
共 28 条
[1]   Cell-autonomous circadian clock of hepatocytes drives rhythms in transcription and polyamine synthesis [J].
Atwood, Ann ;
DeConde, Robert ;
Wang, Susanna S. ;
Mockler, Todd C. ;
Sabir, Jamal S. M. ;
Ideker, Trey ;
Kay, Steve A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (45) :18560-18565
[2]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[3]   Circadian rhythms from multiple oscillators: Lessons from diverse organisms [J].
Bell-Pedersen, D ;
Cassone, VM ;
Earnest, DJ ;
Golden, SS ;
Hardin, PE ;
Thomas, TL ;
Zoran, MJ .
NATURE REVIEWS GENETICS, 2005, 6 (07) :544-556
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   The human circadian metabolome [J].
Dallmann, Robert ;
Viola, Antoine U. ;
Tarokh, Leila ;
Cajochen, Christian ;
Brown, Steven A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (07) :2625-2629
[6]   Design and analysis of large-scale biological rhythm studies: a comparison of algorithms for detecting periodic signals in biological data [J].
Deckard, Anastasia ;
Anafi, Ron C. ;
Hogenesch, John B. ;
Haase, Steven B. ;
Harer, John .
BIOINFORMATICS, 2013, 29 (24) :3174-3180
[7]   Circadian Control of Global Gene Expression Patterns [J].
Doherty, Colleen J. ;
Kay, Steve A. .
ANNUAL REVIEW OF GENETICS, VOL 44, 2010, 44 :419-444
[8]   Coordination of the transcriptome and metabolome by the circadian clock [J].
Eckel-Mahan, Kristin L. ;
Patel, Vishal R. ;
Mohney, Robert P. ;
Vignola, Katie S. ;
Baldi, Pierre ;
Sassone-Corsi, Paolo .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (14) :5541-5546
[9]   An introduction to ROC analysis [J].
Fawcett, Tom .
PATTERN RECOGNITION LETTERS, 2006, 27 (08) :861-874
[10]   A Circadian Rhythm Orchestrated by Histone Deacetylase 3 Controls Hepatic Lipid Metabolism [J].
Feng, Dan ;
Liu, Tao ;
Sun, Zheng ;
Bugge, Anne ;
Mullican, Shannon E. ;
Alenghat, Theresa ;
Liu, X. Shirley ;
Lazar, Mitchell A. .
SCIENCE, 2011, 331 (6022) :1315-1319