A Vision Enhancement and Feature Fusion Multiscale Detection Network

被引:0
|
作者
Chengwu Qian
Jiangbo Qian
Chong Wang
Xulun Ye
Caiming Zhong
机构
[1] Ningbo University,
来源
Neural Processing Letters | / 56卷
关键词
Object detection; Object occlusion; Swin transformer; Vision enhancement;
D O I
暂无
中图分类号
学科分类号
摘要
In the field of object detection, there is often a high level of occlusion in real scenes, which can very easily interfere with the accuracy of the detector. Currently, most detectors use a convolutional neural network (CNN) as a backbone network, but the robustness of CNNs for detection under cover is poor, and the absence of object pixels makes conventional convolution ineffective in extracting features, leading to a decrease in detection accuracy. To address these two problems, we propose VFN (A Vision Enhancement and Feature Fusion Multiscale Detection Network), which first builds a multiscale backbone network using different stages of the Swin Transformer, and then utilizes a vision enhancement module using dilated convolution to enhance the vision of feature points at different scales and address the problem of missing pixels. Finally, the feature guidance module enables features at each scale to be enhanced by fusing with each other. The total accuracy demonstrated by VFN on both the PASCAL VOC dataset and the CrowdHuman dataset is better than that of other methods, and its ability to find occluded objects is also better, demonstrating the effectiveness of our method.The code is available at https://github.com/qcw666/vfn.
引用
收藏
相关论文
共 50 条
  • [41] Attention feature fusion network for small traffic sign detection
    Wu, Miaozhi
    Yang, Jingmin
    Zhang, Wenjie
    Zheng, Yifeng
    Liao, Jianxin
    ENGINEERING RESEARCH EXPRESS, 2022, 4 (03):
  • [42] SSFENET: SPATIAL AND SEMANTIC FEATURE ENHANCEMENT NETWORK FOR OBJECT DETECTION
    Wang, Tianyuan
    Ma, Can
    Su, Haoshan
    Wang, Weiping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1500 - 1504
  • [43] Small Object Detection Network Based on Feature Information Enhancement
    Luo, Huilan
    Wang, Pei
    Chen, Hongkun
    Kowelo, Vladimir Peter
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [44] Adaptively Attentional Feature Fusion Oriented to Multiscale Object Detection in Remote Sensing Images
    Zhao, Wenqing
    Kang, Yijin
    Chen, Hao
    Zhao, Zhenhuan
    Zhao, Zhenbing
    Zhai, Yongjie
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [45] BFEA: A SAR Ship Detection Model Based on Attention Mechanism and Multiscale Feature Fusion
    Zhou, Liming
    Wan, Ziye
    Zhao, Shuai
    Han, Hongyu
    Liu, Yang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11163 - 11177
  • [46] Multiscale feature cross-layer fusion remote sensing target detection method
    Lin, Yuting
    Zhang, Jianxun
    Huang, Jiaming
    IET SIGNAL PROCESSING, 2023, 17 (03)
  • [47] MSMA-Net: An Infrared Small Target Detection Network by Multiscale Super-Resolution Enhancement and Multilevel Attention Fusion
    Ma, Tianlei
    Wang, Hao
    Liang, Jing
    Peng, Jinzhu
    Ma, Qi
    Kai, Zhiqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 20
  • [48] SAR SHIP DETECTION BASED ON SWIN TRANSFORMER AND FEATURE ENHANCEMENT FEATURE PYRAMID NETWORK
    Ke, Xiao
    Zhang, Xiaoling
    Zhang, Tianwen
    Shi, Jun
    Wei, Shunjun
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2163 - 2166
  • [49] Multilayer attention receptive fusion network for multiscale ship detection with complex background
    Zhou, Weina
    Liu, Lu
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [50] MFF-Net: Multiscale feature fusion semantic segmentation network for intracranial surgical instruments
    Liu, Zhenzhong
    Zheng, Laiwang
    Yang, Shubin
    Zhong, Zichen
    Zhang, Guobin
    INTERNATIONAL JOURNAL OF MEDICAL ROBOTICS AND COMPUTER ASSISTED SURGERY, 2024, 20 (01):