Supervised pre-processing approaches in multiple class variables classification for fish recruitment forecasting

被引:29
作者
Fernandes, Jose A. [1 ,2 ]
Lozano, Jose A. [2 ]
Inza, Inaki [2 ]
Irigoien, Xabier [1 ,3 ]
Perez, Aritz [2 ]
Rodriguez, Juan D. [2 ]
机构
[1] AZTI Tecnalia, Div Marine Res, E-20110 Pasaia, Gipuzkoa, Spain
[2] Univ Basque Country, Dept Comp Sci & AI, Intelligent Syst Grp ISG, E-20018 Donostia San Sebastian, Spain
[3] KAUST, Red Sea Res Ctr, Thuwal 239556900, Saudi Arabia
关键词
Supervised classification; Multi-dimensional classification; Bayesian networks; Missing imputation; Discretization; Feature subset selection; Environmental modelling; Recruitment forecasting; BAYESIAN NETWORKS; STATISTICAL COMPARISONS; ECOSYSTEM APPROACH; MANAGEMENT; MODELS; CLASSIFIERS; ALGORITHMS; SELECTION; PREDATION; FISHERIES;
D O I
10.1016/j.envsoft.2012.10.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A multi-species approach to fisheries management requires taking into account the interactions between species in order to improve recruitment forecasting of the fish species. Recent advances in Bayesian networks direct the learning of models with several interrelated variables to be forecasted simultaneously. These models are known as multi-dimensional Bayesian network classifiers (MDBNs). Preprocessing steps are critical for the posterior learning of the model in these kinds of domains. Therefore, in the present study, a set of 'state-of-the-art' uni-dimensional pre-processing methods, within the categories of missing data imputation, feature discretization and feature subset selection, are adapted to be used with MDBNs. A framework that includes the proposed multi-dimensional supervised preprocessing methods, coupled with a MDBN classifier, is tested with synthetic datasets and the real domain of fish recruitment forecasting. The correctly forecasting of three fish species (anchovy, sardine and hake) simultaneously is doubled (from 173% to 29.5%) using the multi-dimensional approach in comparison to mono-species models. The probability assessments also show high improvement reducing the average error (estimated by means of Brier score) from 0.35 to 0.27. Finally, these differences are superior to the forecasting of species by pairs. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:245 / 254
页数:10
相关论文
共 89 条
  • [1] Bayesian networks in environmental modelling
    Aguilera, P. A.
    Fernandez, A.
    Fernandez, R.
    Rumi, R.
    Salmeron, A.
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2011, 26 (12) : 1376 - 1388
  • [2] Hybrid Bayesian network classifiers: Application to species distribution models
    Aguilera, P. A.
    Fernandez, A.
    Reche, F.
    Rumi, R.
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (12) : 1630 - 1639
  • [3] On learning algorithm selection for classification
    Ali, S
    Smith, KA
    [J]. APPLIED SOFT COMPUTING, 2006, 6 (02) : 119 - 138
  • [4] The potential use of a Gadget model to predict stock responses to climate change in combination with Bayesian networks: the case of Bay of Biscay anchovy
    Andonegi, Eider
    Antonio Fernandes, Jose
    Quincoces, Inaki
    Irigoien, Xabier
    Uriarte, Andres
    Perez, Aritz
    Howel, Daniel
    Stefanssons, Gunnar
    [J]. ICES JOURNAL OF MARINE SCIENCE, 2011, 68 (06) : 1257 - 1269
  • [5] [Anonymous], P IEEE INT C FUZZ SY
  • [6] [Anonymous], ICES BENCHM WORKSH S
  • [7] [Anonymous], 2005, Proceedings of the 22nd international conference on Machine learning-ICML'05, DOI [10.1145/1102351.1102373, DOI 10.1145/1102351.1102373]
  • [8] [Anonymous], 24 INT FLAIRS C
  • [9] [Anonymous], 2006, COMPSCI
  • [10] [Anonymous], INT SCH SYNTH EXP KN