Classification of Herring, Salmon, and Bubbles in Multifrequency Echograms Using U-Net Neural Networks

被引:7
作者
Slonimer, Alex L. [1 ]
Dosso, Stan E. [1 ]
Albu, Alexandra Branzan [2 ]
Cote, Melissa [2 ]
Marques, Tunai Porto [2 ]
Rezvanifar, Alireza [2 ]
Ersahin, Kaan [3 ]
Mudge, Todd
Gauthier, Stephane [4 ]
机构
[1] Univ Victoria, Sch Earth & Ocean Sci, Victoria, BC V8P 5C2, Canada
[2] Univ Victoria, Elect & Comp Engn, Victoria, BC V8P 5C2, Canada
[3] ASL Environm Sci, Victoria, BC V8M 1Z5, Canada
[4] Fisheries & Oceans Canada, Sydney, NS B1P 6J9, Canada
关键词
Aquaculture; deep learning; oceans; semantic segmentation; underwater acoustics; ACOUSTIC CLASSIFICATION; FISH-SCHOOLS; BACKSCATTERING; IDENTIFICATION; SYSTEM; SOUND;
D O I
10.1109/JOE.2023.3272393
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Echosounders are used by fisheries and ocean observatories, but significant manual effort is required to classify species of interest within multifrequency echograms. This article investigates the use of modified U-Net convolutional neural networks for the pixel-level classification of biological and physical data in echogram imageswith accurate classification of herring and salmon schools, bubbles, and the sea surface. Data were collected on the coast of British Columbia, Canada, over two years using an Acoustic Zooplankton and Fish Profiler at four frequencies (67, 125, 200, 455 kHz). In addition, simulated data (water depth and solar elevation angle) provide spatial and temporal context to improve the quality of predictions. Redundancy is built into the model by using a tiling strategy during training and classification. During training, using a limited set of annotated data, translational augmentation encodes theU-Netswith robust features that enable applications for alternate deployment configurations (lower sampling rates or alternate water depths). To ensure broad applicability, these networks were trained to classify echograms with noise left intact. The best-performing model classifies herring, salmon, and bubble classes with F1 scores of 93.0%, 87.3%, and 86.5%, respectively. The results are accurate even when multiple classes are in close proximity, thus, retaining biological data that would otherwise be discarded due to surface bubble noise.
引用
收藏
页码:1236 / 1254
页数:19
相关论文
共 54 条
[41]  
Rousseau S., 2017, Canadian Technical Report of Fisheries and Aquatic Sciences, V3227, pI
[42]  
Rousseau S., 2017, 3227 BRIT COL CAN TE, V3227, P1
[43]   Acoustic classification of coexisting taxa in a coastal ecosystem [J].
Sato, Mei ;
Horne, John K. ;
Parker-Stetter, Sandra L. ;
Keister, Julie E. .
FISHERIES RESEARCH, 2015, 172 :130-136
[44]   A survey on Image Data Augmentation for Deep Learning [J].
Shorten, Connor ;
Khoshgoftaar, Taghi M. .
JOURNAL OF BIG DATA, 2019, 6 (01)
[45]  
Simmonds E.J., 2005, FISHERIES ACOUSTICS
[46]   SIMPLE APPROXIMATE FORMULAS FOR BACKSCATTERING OF SOUND BY SPHERICAL AND ELONGATED OBJECTS [J].
STANTON, TK .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (04) :1499-1510
[47]   ON ACOUSTIC ESTIMATES OF ZOOPLANKTON BIOMASS [J].
STANTON, TK ;
WIEBE, PH ;
CHU, DZ ;
BENFIELD, MC ;
SCANLON, L ;
MARTIN, L ;
EASTWOOD, RL .
ICES JOURNAL OF MARINE SCIENCE, 1994, 51 (04) :505-512
[48]   Generalised Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations [J].
Sudre, Carole H. ;
Li, Wenqi ;
Vercauteren, Tom ;
Ourselin, Sebastien ;
Cardoso, M. Jorge .
DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, 2017, 10553 :240-248
[49]   Detecting Submerged Objects Using Active Acoustics and Deep Neural Networks: A Test Case for Pelagic Fish [J].
Testolin, Alberto ;
Kipnis, Dror ;
Diamant, Roee .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (08) :2776-2788
[50]  
Tiu E., 2020, Understanding latent space in machine learning