Improving machine learning predictions to estimate fishing effort using vessel's tracking data

被引:1
|
作者
Samarao, J. [1 ,4 ]
Moreno, A. [1 ]
Gaspar, M. B. [1 ,3 ]
Rufino, M. M. [1 ,2 ]
机构
[1] Portuguese Inst Sea & Atmosphere IPMA, Av Dr Alfredo Magalhaes Ramalho 6, P-149565 Lisbon, Portugal
[2] Univ Lisbon, Fac Sci, Ctr Stat & its Applicat CEAUL, Lisbon, Portugal
[3] CCMAR, Faro, Portugal
[4] Nova Sch Sci & Technol FCT, Almada, Portugal
关键词
Fishing effort; Machine leaning; Spatio-temporal high-resolution data; Small scale fisheries; FISHERY;
D O I
10.1016/j.ecoinf.2024.102953
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Small-Scale Fisheries (SSF) comprise over 80 % of the global fleet and serve as the primary income source for numerous coastal communities. However, these critical fisheries face various threats. To effectively monitor SSF activities and their ecological impacts, it is required precise estimation of fishing effort using high-resolution spatio-temporal data. This information can identify areas with high fishing density, warranting protection of their main fishing grounds against other users (i.e. ocean grabbing), while also signalling potential stock depletion requiring management interventions and preserving the ecosystems from which these fisheries depend on. In this study, we propose a series of steps to enhance the performance of Machine Learning algorithms in estimating fishing effort. We assessed seven supervised ML algorithms, including Logistic Regression, Ridge Classifier, Random Forest Classifier, K-Neighbours, Gradient Boosting Classifier, LinearSVC, Recurrent Neural Networks and XGBoost, using four case studies, from bivalve dredge and octopus pots and traps fisheries. First, in a preliminary statistical analysis between common error measures derived from the confusion matrix was decided to use accuracy, precision, and sensitivity as evaluation criteria. We found that a simple moving average applied to speed, employed as a pre-processing technique using ten neighbouring points, showed up to 3 % improvement in results. Random Forest and XGBoost gave the best performances among the models compared (18 % change), using the variables Latitude, Longitude, Speed, Time, and Month (accuracies near 99 %)(61 % change). The proportion of the training/test dataset, showed a minimal impact on accuracy, with changes of less than 8 % when varying the training data percentage between 10 % and 90 %, making 60 % a suitable compromise. Considering the sampling unit to be (1) point-based (randomly selected pings) or (2) boat trip-based (randomly selected boat trips), leaded to changes in accuracy between 2.53 % and 3.99 %, depending on the model. Temporal resolution (ping rate) showed minimal effects on model performance, ranging from less than 2 % for intervals between 30 s (raw data with irregular time series) to 10 min (regular time series). As a post- processing step, it was concluded that replacing isolated data points with neighbouring values, significantly enhanced the detection of fishing events, with improvements ranging from 80 % to 250 %, depending on the model. In conclusion, this study presents a straightforward procedure for selecting a machine learning method and enhancing its power of classification using simple procedures. These approaches should be applied in all works using machine learning to produce fishing effort maps.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Improving the Quality of Art Market Data Using Linked Open Data and Machine Learning
    Filipiak, Dominik
    Filipowska, Agata
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 418 - 428
  • [42] Dementia risk predictions from German claims data using methods of machine learning
    Reinke, Constantin
    Doblhammer, Gabriele
    Schmid, Matthias
    Welchowski, Thomas
    ALZHEIMERS & DEMENTIA, 2023, 19 (02) : 477 - 486
  • [43] Dynamic Maize Yield Predictions Using Machine Learning on Multi-Source Data
    Croci, Michele
    Impollonia, Giorgio
    Meroni, Michele
    Amaducci, Stefano
    REMOTE SENSING, 2023, 15 (01)
  • [44] Modeling Health Data Using Machine Learning Techniques Applied to Financial Management Predictions
    Sanz, Rafael Leon
    Leon-Sanz, Pilar
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [45] Characterizing Industrial and Artisanal Fishing Vessel Catch Composition Using Environmental DNA and Satellite-Based Tracking Data
    Willette, Demian A.
    Navarrete-Forero, Gabriela
    Gold, Zachary
    Lizano, Apollo Marco D.
    Gonzalez-Smith, Leonardo
    Sotil, Giovanna
    FOODS, 2021, 10 (06)
  • [46] Improving Fishing Pattern Detection from Satellite AIS Using Data Mining and Machine Learning (vol 11, e0158248, 2016)
    de Souza, Erico N.
    Boerder, Kristina
    Matwin, Stan
    Worm, Boris
    PLOS ONE, 2016, 11 (09):
  • [47] Improving Reliability Estimation for Individual Numeric Predictions: A Machine Learning Approach
    Adomavicius, Gediminas
    Wang, Yaqiong
    INFORMS JOURNAL ON COMPUTING, 2022, 34 (01) : 503 - 521
  • [48] Improving Propagation Model Predictions via Machine Learning with Engineered Features
    Vanleer, Ann
    Anderson, Christopher R.
    2021 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2021), 2021,
  • [49] An approach to map and quantify the fishing effort of polyvalent passive gear fishing fleets using geospatial data
    Henriques, Nuno
    Russo, Tommaso
    Bentes, Luis
    Monteiro, Pedro
    Parisi, Antonio
    Magno, Ramiro
    Oliveira, Frederico
    Erzini, Karim
    Goncalves, Jorge M. S.
    ICES JOURNAL OF MARINE SCIENCE, 2023, 80 (06) : 1658 - 1669
  • [50] Improving the interpretation of fishing effort and pressures in mixed fisheries using spatial overlap metrics
    Bourdaud, Pierre
    Travers-Trolet, Morgane
    Vermard, Youen
    Marchal, Paul
    CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 2019, 76 (04) : 586 - 596