DENS-YOLOv6: a small object detection model for garbage detection on water surface

被引:16
作者
Li, Ning [1 ,2 ]
Wang, Mingliang [1 ]
Yang, Gaochao [1 ]
Li, Bo [1 ,3 ]
Yuan, Baohua [1 ]
Xu, Shoukun [1 ]
机构
[1] Changzhou Univ, Sch Comp Sci & Artificial Intelligence, Aliyun Sch Big Data, Sch Software, Changzhou 213164, Peoples R China
[2] Hohai Univ, Sch Comp & Informat Engn, Nanjing 210098, Peoples R China
[3] Changzhou Univ, Jiangsu Petrochem Proc Key Equipment Digital Twin, Changzhou 213164, Peoples R China
关键词
Small object detection; Water surface garbage detection; YOLOv6; Detail information enhancement; Adaptive noise suppression;
D O I
10.1007/s11042-023-17679-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The study of garbage detection onwater surface is of great significance for the development of water surface garbage monitoring and automated water surface garbage salvage. However, in water surface garbage scenes, the proportion of water background is relatively large, while the proportion of detection objects is relatively small. Moreover, the objects are easily affected by noise interference such as lighting, water waves, and reflections, which makes it difficult to extract object features and affects detection accuracy. In this paper, we propose a Detail Enhancement Noise Suppression YOLOv6 (DENS-YOLOv6) detection algorithm based on YOLOv6. Firstly, to better capture the detailed feature information of small objects, we design a Detail Information Enhancement Module (DIEM) based on atrous convolution. Secondly, to suppress noise interference on small objects, we develop an Adaptive Noise Suppression Module (ANSM). Finally, in order to improve the stability and convergence speed of the model training, we employ a regression loss function based on the Normalized Wasserstein Distance(NWD) metric. Experiments were conducted on the Flow+ dataset with a large number of small objects and the publicly available Pascal VOC2007 dataset. The mAPS indicators reached 40.6% and 11.4%, respectively. Compared with other models, DENS-YOLOv6 achieved the highest small object detection accuracy
引用
收藏
页码:55751 / 55771
页数:21
相关论文
共 55 条
  • [1] Few-Shot Object Detection: A Survey
    Antonelli, Simone
    Avola, Danilo
    Cinque, Luigi
    Crisostomi, Donato
    Foresti, Gian Luca
    Galasso, Fabio
    Marini, Marco Raoul
    Mecca, Alessio
    Pannone, Daniele
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [2] SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network
    Bai, Yancheng
    Zhang, Yongqiang
    Ding, Mingli
    Ghanem, Bernard
    [J]. COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 210 - 226
  • [3] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [4] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [5] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [6] FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters
    Cheng, Yuwei
    Zhu, Jiannan
    Jiang, Mengxin
    Fu, Jie
    Pang, Changsong
    Wang, Peidong
    Sankaran, Kris
    Onabola, Olawale
    Liu, Yimin
    Liu, Dianbo
    Bengio, Yoshua
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10933 - 10942
  • [7] Illumination-Guided Transformer-Based Network for Multispectral Pedestrian Detection
    Chu, Fuchen
    Cao, Jiale
    Shao, Zhuang
    Pang, Yanwei
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 343 - 355
  • [8] RepVGG: Making VGG-style ConvNets Great Again
    Ding, Xiaohan
    Zhang, Xiangyu
    Ma, Ningning
    Han, Jungong
    Ding, Guiguang
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13728 - 13737
  • [9] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [10] Gao A, 2022, IEEE Trans Circ Syst Vid Technol