Comparative Genomics Approaches Accurately Predict Deleterious Variants in Plants

被引:32
作者
Kono, Thomas J. Y. [1 ]
Lei, Li [1 ]
Shih, Ching-Hua [2 ]
Hoffman, Paul J. [1 ]
Morrell, Peter L. [1 ]
Fay, Justin C. [2 ,3 ]
机构
[1] Univ Minnesota, Dept Agron & Plant Genet, St Paul, MN 55108 USA
[2] Washington Univ, Dept Genet, St Louis, MO 63110 USA
[3] Univ Rochester, Dept Biol, 319 Hutchison Hall,RC Box 270211, Rochester, NY 14627 USA
基金
美国国家科学基金会;
关键词
deleterious mutations; phenotypes; genome; training set; HUMAN-DISEASE MUTATIONS; ARABIDOPSIS-THALIANA; NONSYNONYMOUS SNVS; SEQUENCE ALIGNMENT; GENETIC-VARIATION; DOMESTICATION; SUBSTITUTIONS; CONSEQUENCES; ACCUMULATION; IDENTIFICATION;
D O I
10.1534/g3.118.200563
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Recent advances in genome resequencing have led to increased interest in prediction of the functional consequences of genetic variants. Variants at phylogenetically conserved sites are of particular interest, because they are more likely than variants at phylogenetically variable sites to have deleterious effects on fitness and contribute to phenotypic variation. Numerous comparative genomic approaches have been developed to predict deleterious variants, but the approaches are nearly always assessed based on their ability to identify known disease-causing mutations in humans. Determining the accuracy of deleterious variant predictions in nonhuman species is important to understanding evolution, domestication, and potentially to improving crop quality and yield. To examine our ability to predict deleterious variants in plants we generated a curated database of 2,910 Arabidopsis thaliana mutants with known phenotypes. We evaluated seven approaches and found that while all performed well, their relative ranking differed from prior benchmarks in humans. We conclude that deleterious mutations can be reliably predicted in A. thaliana and likely other plant species, but that the relative performance of various approaches does not necessarily translate from one species to another.
引用
收藏
页码:3321 / 3329
页数:9
相关论文
共 76 条
[1]  
Adzhubei Ivan, 2013, Curr Protoc Hum Genet, VChapter 7, DOI 10.1002/0471142905.hg0720s76
[2]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[3]   Medical sequencing at the extremes of human body mass [J].
Ahituv, Nadav ;
Kavaslar, Nihan ;
Schackwitz, Wendy ;
Ustaszewska, Anna ;
Martin, Joel ;
Hebert, Sybil ;
Doelle, Heather ;
Ersoy, Baran ;
Kryukov, Gregory ;
Schmidt, Steffen ;
Yosef, Nir ;
Ruppin, Eytan ;
Sharan, Roded ;
Vaisse, Christian ;
Sunyaev, Shamil ;
Dent, Robert ;
Cohen, Jonathan ;
McPherson, Ruth ;
Pennacchio, Len A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (04) :779-791
[4]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[5]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[6]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[7]  
Boutet E, 2016, METHODS MOL BIOL, V1374, P23, DOI 10.1007/978-1-4939-3167-5_2
[8]   Assessing the evolutionary impact of amino acid mutations in the human genome [J].
Boyko, Adam R. ;
Williamson, Scott H. ;
Indap, Amit R. ;
Degenhardt, Jeremiah D. ;
Hernandez, Ryan D. ;
Lohmueller, Kirk E. ;
Adams, Mark D. ;
Schmidt, Steffen ;
Sninsky, John J. ;
Sunyaev, Shamil R. ;
White, Thomas J. ;
Nielsen, Rasmus ;
Clark, Andrew G. ;
Bustamante, Carlos D. .
PLOS GENETICS, 2008, 4 (05)
[9]   Epistasis as the primary factor in molecular evolution [J].
Breen, Michael S. ;
Kemena, Carsten ;
Vlasov, Peter K. ;
Notredame, Cedric ;
Kondrashov, Fyodor A. .
NATURE, 2012, 490 (7421) :535-+
[10]   Whole-genome sequencing of multiple Arabidopsis thaliana populations [J].
Cao, Jun ;
Schneeberger, Korbinian ;
Ossowski, Stephan ;
Guenther, Torsten ;
Bender, Sebastian ;
Fitz, Joffrey ;
Koenig, Daniel ;
Lanz, Christa ;
Stegle, Oliver ;
Lippert, Christoph ;
Wang, Xi ;
Ott, Felix ;
Mueller, Jonas ;
Alonso-Blanco, Carlos ;
Borgwardt, Karsten ;
Schmid, Karl J. ;
Weigel, Detlef .
NATURE GENETICS, 2011, 43 (10) :956-U60