IPez: An expert system for the taxonomic identification of fishes based on machine learning techniques

被引:33
作者
Guisande, C. [1 ]
Manjarres-Hernandez, A. [2 ]
Pelayo-Villamil, P. [3 ]
Granado-Lorencio, C. [4 ]
Riveiro, I. [1 ]
Acuna, A. [1 ]
Prieto-Piraquive, E. [4 ]
Janeiro, E. [5 ]
Matias, J. M. [5 ]
Patti, C. [6 ]
Patti, B. [6 ]
Mazzola, S. [6 ]
Jimenez, S. [7 ]
Duque, V. [7 ]
Salmeron, F. [8 ]
机构
[1] Univ Vigo, Fac Ciencias, Vigo 36200, Spain
[2] Univ Nacl Colombia, Inst Amazon Invest IMANI, Leticia, Colombia
[3] Univ Antioquia, Grp Ictiol, Medellin 1226, Colombia
[4] Univ Seville, Fac Biol, E-41012 Seville, Spain
[5] Univ Vigo, Dept Ingn Recursos Nat & Medio Ambiente, Vigo 36310, Spain
[6] CNR, Ist Ambiente Marino Costiero, I-91026 Mazara Del Vallo, TP, Italy
[7] Ctr Oceanog Canarias, Inst Espanol Oceanog, Tenerife 38120, Spain
[8] Ctr Oceanog Malaga, Inst Espanol Oceanog, Fuengirola 29640, Spain
关键词
Classification and regression trees; Morphology; Taxonomy; Identification; Fishes;
D O I
10.1016/j.fishres.2009.12.003
中图分类号
S9 [水产、渔业];
学科分类号
0908 ;
摘要
The taxonomic identification of fishes is a time-consuming process for those who are not specialists and, therefore, are more liable to make mistakes. We measured morphometric characters in more than 8900 individuals belonging to 6 classes, 43 orders, 192 families, 510 genera and 847 marine and freshwater species. The aim was to determine if the taxonomic identification of juvenile and adult fishes is possible using these measurements. We developed the expert system IPez, which is based on machine learning techniques, and found that, when the number of individuals measured of a species and, hence, included in the database of IPez, is higher than approximately 15 individuals, IPez identifies correctly 100% of new individuals of this species that are not included into the database. Moreover, besides helping in the taxonomic identification of fish, this software allows the determining of the main morphometric characters that have promoted or are promoting divergence among closely related species. The software is free and available at the web page http://www.ipez.es/index%20ingles.html. We suggest increased international collaboration to introduce more species into IPez. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:240 / 247
页数:8
相关论文
共 18 条
[11]   A machine learning methodology for the analysis of workplace accidents [J].
Matias, J. M. ;
Rivas, T. ;
Martin, J. E. ;
Taboada, J. .
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2008, 85 (3-4) :559-578
[12]   Machine learning techniques applied to the determination of road suitability for the transportation of dangerous substances [J].
Matias, J. M. ;
Taboadab, J. ;
Ordonez, C. ;
Nieto, P. G. .
JOURNAL OF HAZARDOUS MATERIALS, 2007, 147 (1-2) :60-66
[13]   GENERALIZED LINEAR MODELS [J].
NELDER, JA ;
WEDDERBURN, RW .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-GENERAL, 1972, 135 (03) :370-+
[14]  
ROHLF FJ, 1990, ANNU REV ECOL SYST, V21, P299, DOI 10.1146/annurev.ecolsys.21.1.299
[15]   A REVOLUTION IN MORPHOMETRICS [J].
ROHLF, FJ ;
MARCUS, LF .
TRENDS IN ECOLOGY & EVOLUTION, 1993, 8 (04) :129-132
[16]  
Segal M.R., 2004, Machine learning benchmarks and random forest regression
[17]  
Venables W.N, 2002, Modern Applied Statistics with S. Fourth, DOI 10.1007/978-0-387-21706-2
[18]   Functional versus morphological diversity in macroevolution [J].
Wainwright, Peter C. .
ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS, 2007, 38 :381-401