Superiority of artificial neural networks for a genetic classification procedure

被引：23

作者：

Sant'Anna, I. C. ^{[1
]}

Tomaz, R. S. ^{[3
]}

Silva, G. N. ^{[2
,4
]}

Nascimento, M. ^{[2
,4
]}

Bhering, L. L. ^{[1
]}

Cruz, C. D. ^{[1
,2
,4
]}

机构：

[1] Univ Fed Vicosa, Programa Posgrad Genet & Melhoramento, Vicosa, MG, Brazil

[2] Univ Fed Vicosa, Programa Posgrad Estat Aplicada & Biometr, Vicosa, MG, Brazil

[3] Univ Estadual Paulista, Dracena, SP, Brazil

[4] Univ Fed Vicosa, Lab Bioinformat, Vicosa, MG, Brazil

来源：

GENETICS AND MOLECULAR RESEARCH | 2015年 / 14卷 / 03期

关键词：

Artificial Intelligence; Discrimination; Similarity; Statistics;

D O I：

10.4238/2015.August.19.24

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

The correct classification of individuals is extremely important for the preservation of genetic variability and for maximization of yield in breeding programs using phenotypic traits and genetic markers. The Fisher and Anderson discriminant functions are commonly used multivariate statistical techniques for these situations, which allow for the allocation of an initially unknown individual to predefined groups. However, for higher levels of similarity, such as those found in backcrossed populations, these methods have proven to be inefficient. Recently, much research has been devoted to developing a new paradigm of computing known as artificial neural networks (ANNs), which can be used to solve many statistical problems, including classification problems. The aim of this study was to evaluate the feasibility of ANNs as an evaluation technique of genetic diversity by comparing their performance with that of traditional methods. The discriminant functions were equally ineffective in discriminating the populations, with error rates of 23-82%, thereby preventing the correct discrimination of individuals between populations. The ANN was effective in classifying populations with low and high differentiation, such as those derived from a genetic design established from backcrosses, even in cases of low differentiation of the data sets. The ANN appears to be a promising technique to solve classification problems, since the number of individuals classified incorrectly by the ANN was always lower than that of the discriminant functions. We envisage the potential relevant application of this improved procedure in the genomic classification of markers to distinguish between breeds and accessions.

引用

页码：9898 / 9906

页数：9

共 24 条

[1] Anderson T. W., 1958, An introduction to multivariate statistical analysis, V2
[2] [Anonymous], 2012, BIOMETRIC MODELS APP
[3] Ardo J., 1997, Canadian Journal of Remote Sensing, V23, P217, DOI DOI 10.1080/07038992.1997.10855204
[4] Artificial neural network analysis of genetic diversity in Carica papaya L.
Barbosa, Cibelle Degel
Viana, Alexandre Pio
Red Quintal, Silvana Silva
Pereira, Messias Gonzaga
[J]. CROP BREEDING AND APPLIED BIOTECHNOLOGY, 2011, 11 (03): : 224 - 231
[5] Bishop C.M., 2006, J ELECTRON IMAGING, V16, P049901
[6] BOSCHI L.S., 2004, B CIENC GEOD, V10, P193
[7] BRAGA A. P., 2011, REDES NEURAIS ARTIFI, V2th
[8] Weed-plant discrimination by machine vision and artificial neural network
Cho, SI
Lee, DS
Jeong, JY
[J]. BIOSYSTEMS ENGINEERING, 2002, 83 (03) : 275 - 280
[9] CRUZ C.D., 2014, Modelos biometricos aplicados ao melhoramento genetico, V2
[10] Cruz CD, 2011, BIOMETRIA APLICADA A

← 1 2 3 →