Temperate fish detection and classification: a deep learning based approach

被引:102
作者
Knausgard, Kristian Muri [1 ]
Wiklund, Arne [2 ]
Sordalen, Tonje Knutsen [3 ,4 ]
Halvorsen, Kim Tallaksen [5 ]
Kleiven, Alf Ring [3 ]
Jiao, Lei [2 ]
Goodwin, Morten [2 ]
机构
[1] Univ Agder UiA, Dept Engn Sci, N-4879 Grimstad, Norway
[2] UiA, Ctr Artificial Intelligence Res, N-4879 Grimstad, Norway
[3] Inst Marine Res IMR, Flodevigen Res Stn, N-4817 His, Norway
[4] UiA, Ctr Coastal Res CCR, Dept Nat Sci, N-4630 Kristiansand, Norway
[5] Inst Marine Res IMR, Ecosyst Acoust Grp, Flodevigen Res Stn, N-4817 His, Norway
关键词
Biometric fish classification; Temperate species; Deep learning; Object detection; CNN; Underwater video;
D O I
10.1007/s10489-020-02154-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A wide range of applications in marine ecology extensively uses underwater cameras. Still, to efficiently process the vast amount of data generated, we need to develop tools that can automatically detect and recognize species captured on film. Classifying fish species from videos and images in natural environments can be challenging because of noise and variation in illumination and the surrounding habitat. In this paper, we propose a two-step deep learning approach for the detection and classification of temperate fishes without pre-filtering. The first step is to detect each single fish in an image, independent of species and sex. For this purpose, we employ the You Only Look Once (YOLO) object detection technique. In the second step, we adopt a Convolutional Neural Network (CNN) with the Squeeze-and-Excitation (SE) architecture for classifying each fish in the image without pre-filtering. We apply transfer learning to overcome the limited training samples of temperate fishes and to improve the accuracy of the classification. This is done by training the object detection model with ImageNet and the fish classifier via a public dataset (Fish4Knowledge), whereupon both the object detection and classifier are updated with temperate fishes of interest. The weights obtained from pre-training are applied to post-training as a priori. Our solution achieves the state-of-the-art accuracy of 99.27% using the pre-training model. The accuracies using the post-training model are also high; 83.68% and 87.74% with and without image augmentation, respectively. This strongly indicates that the solution is viable with a more extensive dataset.
引用
收藏
页码:6988 / 7001
页数:14
相关论文
共 31 条
[1]   The Dreaming Variational Autoencoder for Reinforcement Learning Environments [J].
Andersen, Per-Arne ;
Goodwin, Morten ;
Granmo, Ole-Christoffer .
ARTIFICIAL INTELLIGENCE XXXV (AI 2018), 2018, 11311 :143-155
[2]  
Ba J, 2014, ARXIV ABS14126980
[3]  
Bochkovskiy A., 2020, PREPRINT
[4]   Automatic Fish Classification System Using Deep Learning [J].
Chen, Guang ;
Sun, Peng ;
Shang, Yi .
2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, :24-29
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]  
Fordham S, 2016, SQUALUS ACANTHIAS IU
[7]  
Francour P., 1999, Naturalista Siciliana (special suppl. issue), VXXIII, P155
[8]   Sex- and size-selective harvesting of corkwing wrasse (Symphodus melops)-a cleaner fish used in salmonid aquaculture [J].
Halvorsen, Kim Tallaksen ;
Sordalen, Tonje Knutsen ;
Vollestad, Leif Asbjorn ;
Skiftesvik, Anne Berit ;
Espeland, Sigurd Heiberg ;
Olsen, Esben Moland .
ICES JOURNAL OF MARINE SCIENCE, 2017, 74 (03) :660-669
[9]   Male-biased sexual size dimorphism in the nest building corkwing wrasse (Symphodus melops): implications for a size regulated fishery [J].
Halvorsen, Kim Tallaksen ;
Sordalen, Tonje Knutsen ;
Durif, Caroline ;
Knutsen, Halvor ;
Olsen, Esben Moland ;
Skiftesvik, Anne Berit ;
Rustand, Torborg Emmerhoff ;
Bjelland, Reidun Marie ;
Vollestad, Leif Asbjorn .
ICES JOURNAL OF MARINE SCIENCE, 2016, 73 (10) :2586-2594
[10]  
Hu, 2017, ARXIVABS170901507