A Critical Assessment of Storytelling: Gene Ontology Categories and the Importance of Validating Genomic Scans

被引:168
作者
Pavlidis, Pavlos [1 ]
Jensen, Jeffrey D. [2 ]
Stephan, Wolfgang [3 ]
Stamatakis, Alexandros [1 ]
机构
[1] Heidelberg Inst Theoret Studies HITS gGmbH, Sci Comp Grp, Exelixis Lab, Heidelberg, Germany
[2] Ecole Polytech Fed Lausanne, Sch Life Sci, Lausanne, Switzerland
[3] Univ Munich, Sect Evolutionary Biol, Bioctr, Planegg Martinsried, Germany
基金
美国国家科学基金会;
关键词
genome scanning; positive selection; gene ontology; validation; literature mining; STRONG POSITIVE SELECTION; DROSOPHILA-MELANOGASTER; NATURAL-SELECTION; PATTERNS; DIFFERENTIATION; HITCHHIKING; POPULATION; SIGNATURES; SWEEPS; LOCI;
D O I
10.1093/molbev/mss136
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the age of whole-genome population genetics, so-called genomic scan studies often conclude with a long list of putatively selected loci. These lists are then further scrutinized to annotate these regions by gene function, corresponding biological processes, expression levels, or gene networks. Such annotations are often used to assess and/or verify the validity of the genome scan and the statistical methods that have been used to perform the analyses. Furthermore, these results are frequently considered to validate "true-positives" if the identified regions make biological sense a posteriori. Here, we show that this approach can be potentially misleading. By simulating neutral evolutionary histories, we demonstrate that it is possible not only to obtain an extremely high false-positive rate but also to make biological sense out of the false-positives and construct a sensible biological narrative. Results are compared with a recent polymorphism data set from Drosophila melanogaster.
引用
收藏
页码:3237 / 3248
页数:12
相关论文
共 68 条
[1]   Interrogating a high-density SNP map for signatures of natural selection [J].
Akey, JM ;
Zhang, G ;
Zhang, K ;
Jin, L ;
Shriver, MD .
GENOME RESEARCH, 2002, 12 (12) :1805-1814
[2]   Selection upon Genome Architecture: Conservation of Functional Neighborhoods with Changing Genes [J].
Al-Shahrour, Fatima ;
Minguez, Pablo ;
Marques-Bonet, Tomas ;
Gazave, Elodie ;
Navarro, Arcadi ;
Dopazo, Joaquin .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (10)
[3]   A genomic analysis of Drosophila somatic sexual differentiation and its regulation [J].
Arbeitman, MN ;
Fleming, AA ;
Siegal, ML ;
Null, BH ;
Baker, BS .
DEVELOPMENT, 2004, 131 (09) :2007-2021
[4]   Systems genetics of complex traits in Drosophila melanogaster [J].
Ayroles, Julien F. ;
Carbone, Mary Anna ;
Stone, Eric A. ;
Jordan, Katherine W. ;
Lyman, Richard F. ;
Magwire, Michael M. ;
Rollmann, Stephanie M. ;
Duncan, Laura H. ;
Lawrence, Faye ;
Anholt, Robert R. H. ;
Mackay, Trudy F. C. .
NATURE GENETICS, 2009, 41 (03) :299-307
[5]   The effect of hitch-hiking on neutral genealogies [J].
Barton, NH .
GENETICS RESEARCH, 1998, 72 (02) :123-133
[6]   Evidence for a selective sweep in the wapl region of Drosophila melanogaster [J].
Beisswanger, S ;
Stephan, W ;
De Lorenzo, D .
GENETICS, 2006, 172 (01) :265-274
[7]   Evidence that strong positive selection drives neofunctionalization in the tandemly duplicated polyhomeotic genes in Drosophila [J].
Beisswanger, Steffen ;
Stephan, Wolfgang .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (14) :5447-5452
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]   Toll and Toll-9 in Drosophila innate immune response [J].
Bettencourt, R ;
Tanji, T ;
Yagi, Y ;
Ip, YT .
JOURNAL OF ENDOTOXIN RESEARCH, 2004, 10 (04) :261-268
[10]   Natural selection on protein-coding genes in the human genome [J].
Bustamante, CD ;
Fledel-Alon, A ;
Williamson, S ;
Nielsen, R ;
Hubisz, MT ;
Glanowski, S ;
Tanenbaum, DM ;
White, TJ ;
Sninsky, JJ ;
Hernandez, RD ;
Civello, D ;
Adams, MD ;
Cargill, M ;
Clark, AG .
NATURE, 2005, 437 (7062) :1153-1157