A real-time object detection method for underwater complex environments based on FasterNet-YOLOv7

被引:0
作者
Qing Yang
Huijuan Meng
Yuchen Gao
Dexin Gao
机构
[1] Qingdao University of Science and Technology,School of Information Science and Technology
[2] Qingdao University of Science and Technology,School of Automation and Electronic Engineering
来源
Journal of Real-Time Image Processing | 2024年 / 21卷
关键词
Underwater object detection; UWCNN; FasterNet-YOLOv7; Focal-EIOU loss; Complex environments;
D O I
暂无
中图分类号
学科分类号
摘要
A FasterNet-You Only Look Once (YOLO)v7 algorithm is proposed for underwater complex environments with blurred images and complex backgrounds, which lead to difficulties in object target feature extraction and target miss detection, and to improve the fusion capability and real-time detection of small underwater targets. Before training the improved model, the original images acquired by the underwater robot are preprocessed in combination with the Underwater Image Enhancement Convolutional Neural Network (UWCNN) algorithm, which helps to identify targets accurately in the complex marine environment. First, to extract spatial features more efficiently, the algorithm uses Faster Neural Networks (FasterNet-L) as the backbone network model as well as an improved loss function, Focal Efficient Intersection over Union Loss (Focal-EIOU Loss), to reduce redundant computations and memory access, and the regression process focuses on high-quality anchor frames. Second, for the problem of poor robustness of small targets in an underwater environment, the algorithm uses the Cross-modal Transformer Attention (CoTAttention) lightweight attention mechanism to improve the original algorithm so that the detection targets are enhanced in channel and spatial dimensions. Finally, the experimental results show that the mean average precision (mAP) value of this paper's algorithm reaches 91.8% and the actual detection video frame rate reaches 83.21. FasterNet-YOLOv7 has higher detection accuracy compared with Faster Region-Based Convolutional Neural Network (Faster RCNN), Single Shot MultiBox Detection (SSD), YOLOv4, YOLOv5, and YOLOv7 models and is more accurate.
引用
收藏
相关论文
共 34 条
  • [31] TTPRNet: A Real-Time and Precise Tea Tree Pest Recognition Model in Complex Tea Garden Environments
    Li, Yane
    Chen, Ting
    Xia, Fang
    Feng, Hailin
    Ruan, Yaoping
    Weng, Xiang
    Weng, Xiaoxing
    AGRICULTURE-BASEL, 2024, 14 (10):
  • [32] Real-Time Relative Positioning Study of an Underwater Bionic Manta Ray Vehicle Based on Improved YOLOx
    Zhao, Qiaoqiao
    Zhang, Lichuan
    Zhu, Yuchen
    Liu, Lu
    Huang, Qiaogao
    Cao, Yong
    Pan, Guang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (02)
  • [33] A Complex Environmental Water-Level Detection Method Based on Improved YOLOv5m
    Li, Jiadong
    Tong, Chunya
    Yuan, Hongxing
    Huang, Wennan
    SENSORS, 2024, 24 (16)
  • [34] Real-time detection of uneaten feed pellets in underwater images for aquaculture using an improved YOLO-V4 network
    Hu, Xuelong
    Liu, Yang
    Zhao, Zhengxi
    Liu, Jintao
    Yang, Xinting
    Sun, Chuanheng
    Chen, Shuhan
    Li, Bin
    Zhou, Chao
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 185