Dual Stream Encoder-Decoder Architecture with Feature Fusion Model for Underwater Object Detection

被引:1
作者
Nissar, Mehvish [1 ]
Mishra, Amit Kumar [2 ]
Subudhi, Badri Narayan [1 ]
机构
[1] Indian Inst Technol Jammu, Dept Elect Engn, Jammu 181221, India
[2] Aberystwyth Univ, Fac Business & Phys Sci, Aberystwyth SY23 3FL, Wales
关键词
underwater surveillance; object detection; deep learning; CNN; background subtraction; video surveillance; foreground segmentation; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.3390/math12203227
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Underwater surveillance is an imminent and fascinating exploratory domain, particularly in monitoring aquatic ecosystems. This field offers valuable insights into underwater behavior and activities, which have broad applications across various domains. Specifically, underwater surveillance involves detecting and tracking moving objects within aquatic environments. However, the complex properties of water make object detection a challenging task. Background subtraction is a commonly employed technique for detecting local changes in video scenes by segmenting images into the background and foreground to isolate the object of interest. Within this context, we propose an innovative dual-stream encoder-decoder framework based on the VGG-16 and ResNet-50 models for detecting moving objects in underwater frames. The network includes a feature fusion module that effectively extracts multiple-level features. Using a limited set of images and performing training in an end-to-end manner, the proposed framework yields accurate results without post-processing. The efficacy of the proposed technique is confirmed through visual and quantitative comparisons with eight cutting-edge methods using two standard databases. The first one employed in our experiments is the Underwater Change Detection Dataset, which includes five challenges, each challenge comprising approximately 1000 frames. The categories in this dataset were recorded under various underwater conditions. The second dataset used for practical analysis is the Fish4Knowledge dataset, where we considered five challenges. Each category, recorded in different aquatic settings, contains a varying number of frames, typically exceeding 1000 per category. Our proposed method surpasses all methods used for comparison by attaining an average F-measure of 0.98 on the Underwater Change Detection Dataset and 0.89 on the Fish4Knowledge dataset.
引用
收藏
页数:22
相关论文
共 73 条
  • [1] A deep convolutional neural network for video sequence background subtraction
    Babaee, Mohammadreza
    Duc Tung Dinh
    Rigoll, Gerhard
    [J]. PATTERN RECOGNITION, 2018, 76 : 635 - 649
  • [2] Bajpai V, 2021, OCEANS 2021: SAN DIEGO - PORTO
  • [3] Baojie Fan, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12365), P275, DOI 10.1007/978-3-030-58565-5_17
  • [4] ViBe: A Universal Background Subtraction Algorithm for Video Sequences
    Barnich, Olivier
    Van Droogenbroeck, Marc
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (06) : 1709 - 1724
  • [5] Braham M, 2017, IEEE IMAGE PROC, P4552, DOI 10.1109/ICIP.2017.8297144
  • [6] Total Variation Regularized Tensor RPCA for Background Subtraction From Compressive Measurements
    Cao, Wenfei
    Wang, Yao
    Sun, Jian
    Meng, Deyu
    Yang, Can
    Cichocki, Andrzej
    Xu, Zongben
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (09) : 4075 - 4090
  • [7] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [8] HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection
    Chen, Gangqi
    Mao, Zhaoyong
    Wang, Kai
    Shen, Junge
    [J]. REMOTE SENSING, 2023, 15 (04)
  • [9] Research on an Underwater Object Detection Network Based on Dual-Branch Feature Extraction
    Chen, Xiao
    Yuan, Mujiahui
    Fan, Chenye
    Chen, Xingwu
    Li, Yaan
    Wang, Haiyan
    [J]. ELECTRONICS, 2023, 12 (16)
  • [10] RailFOD23: A dataset for foreign object detection on railroad transmission lines
    Chen, Zhichao
    Yang, Jie
    Feng, Zhicheng
    Zhu, Hao
    [J]. SCIENTIFIC DATA, 2024, 11 (01)