Genomic Prediction in Pea: Effect of Marker Density and Training Population Size and Composition on Prediction Accuracy

被引:69
作者
Tayeh, Nadim [1 ]
Klein, Anthony [1 ]
Le Paslier, Marie-Christine [2 ]
Jacquin, Francoise [1 ]
Houtin, Herve [1 ]
Rond, Celine [1 ]
Chabert-Martinello, Marianne [1 ]
Magnin-Robert, Jean-Bernard [1 ]
Marget, Pascal [1 ]
Aubert, Gregoire [1 ]
Burstin, Judith [1 ]
机构
[1] INRA, Agroecol UMR1347, F-21034 Dijon, France
[2] CEA IG Ctr Natl Genotypage, INRA, Etud Polymorphisme Genomes Vegetaux US1279, Evry, France
来源
FRONTIERS IN PLANT SCIENCE | 2015年 / 6卷
关键词
pea (Pisum sativum L.); GenoPea 13.2K SNP Array; genomic selection; marker density; training set; prediction accuracy; SELECTION; RELIABILITY; TRAITS; SET;
D O I
10.3389/fpls.2015.00941
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Pea is an important food and feed crop and a valuable component of low input farming systems. Improving resistance to biotic and abiotic stresses is a major breeding target to enhance yield potential and regularity. Genomic selection (GS) has lately emerged as a promising technique to increase the accuracy and gain of marker-based selection. It uses genome-wide molecular marker data to predict the breeding values of candidate lines to selection. A collection of 339 genetic resource accessions (CRB339) was subjected to high-density genotyping using the GenoPea 13.2K SNP Array. Genomic prediction accuracy was evaluated for thousand seed weight (TSW), the number of seeds per plant (NSeed), and the date of flowering (BegFlo). Mean cross environment prediction accuracies reached 0.83 for TSW, 0.68 for NSeed, and 0.65 for BegFlo. For each trait, the statistical method, the marker density, and/or the training population size and composition used for prediction were varied to investigate their effects on prediction accuracy: the effect was large for the size and composition of the training population but limited for the statistical method and marker density. Maximizing the relatedness between individuals in the training and test sets, through the CDmean-based method, significantly improved prediction accuracies. A cross-population cross-validation experiment was further conducted using the CRB339 collection as a training population set and nine recombinant inbred lines populations as test set. Prediction quality was high with mean Q(2) of 0.44 for TSW and 0.59 for BegFlo. Results are discussed in the light of current efforts to develop GS strategies in pea.
引用
收藏
页数:11
相关论文
共 32 条
  • [1] [Anonymous], PLANT J IN PRESS
  • [2] [Anonymous], 2008, J STAT SOFTW
  • [3] Genomics-assisted breeding in four major pulse crops of developing countries: present status and prospects
    Bohra, Abhishek
    Pandey, Manish K.
    Jha, Uday C.
    Singh, Balwant
    Singh, Indra P.
    Datta, Dibendu
    Chaturvedi, Sushil K.
    Nadarajan, N.
    Varshney, Rajeev K.
    [J]. THEORETICAL AND APPLIED GENETICS, 2014, 127 (06) : 1263 - 1291
  • [4] Burstin J, 2011, BIOLOGY AND BREEDING OF FOOD LEGUMES, P314, DOI 10.1079/9781845937669.0314
  • [5] Genetic diversity and trait genomic prediction in a pea diversity panel
    Burstin, Judith
    Salloignon, Pauline
    Chabert-Martinello, Marianne
    Magnin-Robert, Jean-Bernard
    Siol, Mathieu
    Jacquin, Francoise
    Chauveau, Aurelie
    Pont, Caroline
    Aubert, Gregoire
    Delaitre, Catherine
    Truntzer, Caroline
    Duc, Gerard
    [J]. BMC GENOMICS, 2015, 16
  • [6] Genome-wide prediction of three important traits in bread wheat
    Charmet, Gilles
    Storlie, Eric
    Oury, Francois Xavier
    Laurent, Valerie
    Beghin, Denis
    Chevarin, Laetitia
    Lapierre, Annie
    Perretant, Marie Reine
    Rolland, Bernard
    Heumez, Emmanuel
    Duchalais, Laure
    Goudemand, Ellen
    Bordes, Jacques
    Robert, Olivier
    [J]. MOLECULAR BREEDING, 2014, 34 (04) : 1843 - 1852
  • [7] Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP
    Endelman, Jeffrey B.
    [J]. PLANT GENOME, 2011, 4 (03): : 250 - 255
  • [8] Regularization Paths for Generalized Linear Models via Coordinate Descent
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Rob
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01): : 1 - 22
  • [9] Experimental assessment of the accuracy of genomic selection in sugarcane
    Gouy, M.
    Rousselle, Y.
    Bastianelli, D.
    Lecomte, P.
    Bonnal, L.
    Roques, D.
    Efile, J. -C.
    Rocher, S.
    Daugrois, J.
    Toubi, L.
    Nabeneza, S.
    Hervouet, C.
    Telismart, H.
    Denis, M.
    Thong-Chane, A.
    Glaszmann, J. C.
    Hoarau, J. -Y
    Nibouche, S.
    Costet, L.
    [J]. THEORETICAL AND APPLIED GENETICS, 2013, 126 (10) : 2575 - 2586
  • [10] Heffner EL, 2011, PLANT GENOME-US, V4, P65, DOI [10.3835/plantgenome2010.12.0029, 10.3835/plantgenome.2010.12.0029]