Improved deep learning framework for fish segmentation in underwater videos

被引:43
作者
Alshdaifat, Nawaf Farhan Funkur [1 ]
Talib, Abdullah Zawawi [1 ]
Osman, Mohd Azam [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town 11800, Malaysia
关键词
Deep learning; Instance segmentation; Fish detection;
D O I
10.1016/j.ecoinf.2020.101121
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Deep learning networks have become increasingly popular in recent years due to promising breakthroughs achieved in several areas. The importance of deep learning lies in the localisation and classification of an object based on frames. This study focuses on fish recognition methods in underwater videos and addresses the underlying challenges of these methods. It is important to develop effective methods to recognise fish and their movements using underwater videos. From a practical and scientific perspective, this is extremely useful to automatically recognise fish through their movement and to monitor and collect biomass in marine bodies. More importantly, it allows researchers to collect and analyse information related to the health and well-being of the Marine ecosystem. As most of the current methods work on static images, the issue arises when these methods are applied to images from videos. The existing multiple fish detection methods for underwater videos have a low detection rate due to the inherent underwater conditions such as the presence of coral reefs and other challenges which include the different sizes, shapes, colour, and speed of fish as well as marine behaviours such as the overlapping of fish. Therefore, the use of improved methods based on the latest deep learning algorithms has been proposed for multiple fish detection. This paper provides a novel framework for fish instance segmentation in underwater videos. The proposed model for improved recognition methods is composed of four main stages: 1) pre-processing method to reduce external factors in the videos for better detection and recognition of fish in underwater videos, 2) use of deep learning approach for enhanced detection of fish using RESENT, 3) enhanced detection of multiple fish based on the Region Proposal Network (RPN) architecture, and 4) use of a dynamic instance segmentation method. The results of this study indicate that the proposed framework has a better performance capability than other state-of-the-art models for multi-fish instance segmentation.
引用
收藏
页数:11
相关论文
共 27 条
[1]  
Abadi M.M., 2015, TENSORFLOW LARGE SCA, DOI 10.5555/3026877.3026899
[2]  
Aljarrah I.a., 2012, International Journal of Digital Information and Wireless Communications, V2, P156
[3]  
[Anonymous], 2014, ADV COMPUTER SCI ITS
[4]  
BackProp E., 1998, NEURAL NETWORKS TRIC
[5]   YOLACT Real-time Instance Segmentation [J].
Bolya, Daniel ;
Zhou, Chong ;
Xiao, Fanyi ;
Lee, Yong Jae .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9156-9165
[6]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[7]  
Fisher RB, 2016, INTEL SYST REF LIBR, V104, P1, DOI 10.1007/978-3-319-30208-9_1
[8]  
GIRSHICK R, 2014, PROC CVPR IEEE, P580, DOI DOI 10.1109/CVPR.2014.81
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]  
Gschwandtner M, 2011, ADV VISUAL COMPUTING, P199, DOI [10.1007/978-3-642-24031-7_20, DOI 10.1007/978-3-642-24031-7_20]