Composited FishNet: Fish Detection and Species Recognition From Low-Quality Underwater Videos

被引:102
作者
Zhao, Zhenxi [1 ,2 ,3 ]
Liu, Yang [1 ,2 ,3 ]
Sun, Xudong [4 ]
Liu, Jintao [5 ]
Yang, Xinting [1 ,2 ,3 ]
Zhou, Chao [1 ,2 ,3 ]
机构
[1] Natl Engn Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
[2] Beijing Res Ctr Informat Technol Agr, Beijing 100097, Peoples R China
[3] Natl Engn Lab Agriprod Qual Traceabil, Beijing 100097, Peoples R China
[4] East China Jiaotong Univ, Sch Mechatron Engn, Nanchang 330013, Jiangxi, Peoples R China
[5] Univ Almeria, Sch Engn, Almeria 04120, Spain
基金
北京市自然科学基金;
关键词
Fish; Feature extraction; Object detection; Task analysis; Detectors; Data mining; Object recognition; Composite backbone network; composited FishNet; fish detection; feature fusion; underwater videos;
D O I
10.1109/TIP.2021.3074738
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automatic detection and identification of fish from underwater videos is of great significance for fishery resource assessment and ecological environment monitoring. However, due to the poor quality of underwater images and unconstrained fish movement, traditional hand-designed feature extraction methods or convolutional neural network (CNN)-based object detection algorithms cannot meet the detection requirements in real underwater scenes. Therefore, to realize fish recognition and localization in a complex underwater environment, this paper proposes a novel composite fish detection framework based on a composite backbone and an enhanced path aggregation network called Composited FishNet. By improving the residual network (ResNet), a new composite backbone network (CBresnet) is designed to learn the scene change information (source domain style), which is caused by the differences in the image brightness, fish orientation, seabed structure, aquatic plant movement, fish species shape and texture differences. Thus, the interference of underwater environmental information on the object characteristics is reduced, and the output of the main network to the object information is strengthened. In addition, to better integrate the high and low feature information output from CBresnet, the enhanced path aggregation network (EPANet) is also designed to solve the insufficient utilization of semantic information caused by linear upsampling. The experimental results show that the average precision (AP)(0.5:0.95), AP(50) and average recall (AR)(max=10) of the proposed Composited FishNet are 75.2%, 92.8% and 81.1%, respectively. The composite backbone network enhances the characteristic information output of the detected object and improves the utilization of characteristic information. This method can be used for fish detection and identification in complex underwater environments such as oceans and aquaculture.
引用
收藏
页码:4719 / 4734
页数:16
相关论文
共 52 条
[1]  
[Anonymous], 2017, ABS170404861 CORR
[2]  
[Anonymous], 2015, PROC ADVNEURAL INF P
[3]   Underwater Live Fish Recognition by Deep Learning [J].
Ben Tamou, Abdelouahid ;
Benzinou, Abdesslam ;
Nasreddine, Kamal ;
Ballihi, Lahoucine .
IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 :275-283
[4]  
Bochkovskiy A., 2020, PREPRINT
[5]   D2Det: Towards High Quality Object Detection and Instance Segmentation [J].
Cao, Jiale ;
Cholakkal, Hisham ;
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
Pang, Yanwei ;
Shao, Ling .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11482-11491
[6]   A Feature Learning and Object Recognition Framework for Underwater Fish Images [J].
Chuang, Meng-Che ;
Hwang, Jenq-Neng ;
Williams, Kresimir .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (04) :1862-1872
[7]   Automated detection of rockfish in unconstrained underwater videos using Haar cascades and a new image dataset: labeled fishes in the wild [J].
Cutter, George ;
Stierhoff, Kevin ;
Zeng, Jiaming .
2015 IEEE WINTER APPLICATIONS AND COMPUTER VISION WORKSHOPS (WACVW), 2015, :57-62
[8]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[9]   Improving Pantanal fish species recognition through taxonomic ranks in convolutional neural networks [J].
dos Santos, Anderson Aparecido ;
Goncalves, Wesley Nunes .
ECOLOGICAL INFORMATICS, 2019, 53
[10]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136