Addressing class imbalance in deep learning for acoustic target classification

被引:7
作者
Pala, Ahmet [1 ]
Oleynik, Anna [1 ]
Utseth, Ingrid [2 ]
Handegard, Nils Olav [3 ]
机构
[1] Univ Bergen, Dept Math, Allegaten 41, N-5008 Bergen, Norway
[2] Norwegian Comp Ctr, POb 114, N-0314 Oslo, Norway
[3] Inst Marine Res, Nordnesgaten 50, N-5005 Bergen, Norway
关键词
acoustic target classification; big data; class imbalance; deep learning; similarity-based sampling; SPECIES IDENTIFICATION; NEURAL-NETWORKS; FISHERIES;
D O I
10.1093/icesjms/fsad165
中图分类号
S9 [水产、渔业];
学科分类号
0908 ;
摘要
Acoustic surveys provide important data for fisheries management. During the surveys, ship-mounted echo sounders send acoustic signals into the water and measure the strength of the reflection, so-called backscatter. Acoustic target classification (ATC) aims to identify backscatter signals by categorizing them into specific groups, e.g. sandeel, mackerel, and background (as bottom and plankton). Convolutional neural networks typically perform well for ATC but fail in cases where the background class is similar to the foreground class. In this study, we discuss how to address the challenge of class imbalance in the sampling of training and validation data for deep convolutional neural networks. The proposed strategy seeks to equally sample areas containing all different classes while prioritizing background data that have similar characteristics to the foreground class. We investigate the performance of the proposed sampling methodology for ATC using a previously published deep convolutional neural network architecture on sandeel data. Our results demonstrate that utilizing this approach enables accurate target classification even when dealing with imbalanced data. This is particularly relevant for pixel-wise semantic segmentation tasks conducted on extensive datasets. The proposed methodology utilizes state-of-the-art deep learning techniques and ensures a systematic approach to data balancing, avoiding ad hoc methods.
引用
收藏
页码:2530 / 2544
页数:15
相关论文
共 61 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]   Boosted Near-miss Under-sampling on SVM ensembles for concept detection in large-scale imbalanced datasets [J].
Bao, Lei ;
Juan, Cao ;
Li, Jintao ;
Zhang, Yongdong .
NEUROCOMPUTING, 2016, 172 :198-206
[3]  
Barandela R, 2004, LECT NOTES COMPUT SC, V3138, P806
[4]   Classifying imbalanced data sets using similarity based hierarchical decomposition [J].
Beyan, Cigdem ;
Fisher, Robert .
PATTERN RECOGNITION, 2015, 48 (05) :1653-1672
[5]  
Birnie C, 2021, GEOPHYSICS, V86, pKS151, DOI [10.1190/geo2021-0130.1, 10.1190/GEO2021-0130.1]
[6]   Colour maps for fisheries acoustic echograms [J].
Blackwell, Robert E. ;
Harvey, Richard ;
Queste, Bastien Y. ;
Fielding, Sophie .
ICES JOURNAL OF MARINE SCIENCE, 2020, 77 (02) :826-834
[7]   Neighbourhood sampling in bagging for imbalanced data [J].
Blaszczynski, Jerzy ;
Stefanowski, Jerzy .
NEUROCOMPUTING, 2015, 150 :529-542
[8]   Acoustic classification in multifrequency echosounder data using deep convolutional neural networks [J].
Brautaset, Olav ;
Waldeland, Anders Ueland ;
Johnsen, Espen ;
Malde, Ketil ;
Eikvil, Line ;
Salberg, Arnt-Borre ;
Handegard, Nils Olav .
ICES JOURNAL OF MARINE SCIENCE, 2020, 77 (04) :1391-1400
[9]   A systematic study of the class imbalance problem in convolutional neural networks [J].
Buda, Mateusz ;
Maki, Atsuto ;
Mazurowski, Maciej A. .
NEURAL NETWORKS, 2018, 106 :249-259
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)