Target detection algorithm based on multilayer attention mechanism-adaptive feature fusion network

被引：3

作者：

An, Fengping ^{[1
]}

Wang, Jianrong ^{[2
,3
]}

机构：

[1] Huaiyin Normal Univ, Sch Phys & Elect Elect, Engn, Huaian 223300, Peoples R China

[2] Shanxi Univ, Sch Math Sci, Tianjin 030006, SX, Peoples R China

[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2023年 / 14卷 / 08期

基金：

中国博士后科学基金;

关键词：

Target Detection; Multi-layer Attention Mechanism; Adaptive Feature Fusion Network; Node Attention; Feature Reinforcement; OBJECT DETECTION;

D O I：

10.1007/s13042-023-01791-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Target images are complex and diverse under the influence of scale, occlusion and appearance factors in real scenes, and they affect the performance of target detection algorithms. They also make the existing target detection algorithms suffer from the following problems. On the one hand, the neurons in the target detection algorithm architecture cannot learn the complex interaction and semantic features inside the target image. On the other hand, the feature expression of different target images is insufficient and the channel reduction leads to the loss of position information and other problems. herefore, a multi-layer attention mechanism of considering both node and semantic level attention in the model architecture is proposed. In this method, the fusion of neighbors and semantic information is weighted, and node representations is learned under a hierarchical aggregation manner.Just because of this, it can improve the effectiveness and interpretability of the model, and solve the problem of complex interaction and rich semantic feature acquisition within images. Furthermore, we propose an adaptive feature fusion network which can adaptively filter the useless information of other layers and retain the feature information that is beneficial to target recognition. A feature enhancement module, is introduced to enhance the identifiability of the top-level target features of the feature network, and which can alleviate the problem of loss of target position. Finally, the extensive tests using PASCAL VOC and MSCOCO datasets, the experimental result shows that our method not only has the best recognition performance, but also has better stability and robustness.

引用

页码：2685 / 2695

页数：11

共 43 条

[21] Rapid object detection using a boosted cascade of simple features [J].

Viola, P ;

Jones, M .

2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, :511-518

[22] Robust real-time face detection [J].

Viola, P ;

Jones, MJ .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 57 (02) :137-154

[23] Scaled-YOLOv4: Scaling Cross Stage Partial Network [J].

Wang, Chien-Yao ;

Bochkovskiy, Alexey ;

Liao, Hong-Yuan Mark .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13024-13033

[24] Region Proposal by Guided Anchoring [J].

Wang, Jiaqi ;

Chen, Kai ;

Yang, Shuo ;

Loy, Chen Change ;

Lin, Dahua .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2960-2969

[25] An HOG-LBP Human Detector with Partial Occlusion Handling [J].

Wang, Xiaoyu ;

Han, Tony X. ;

Yan, Shuicheng .

2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :32-39

[26] Towards Universal Object Detection by Domain Attention [J].

Wang, Xudong ;

Cai, Zhaowei ;

Gao, Dashan ;

Vasconcelos, Nuno .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7281-7290

[27] Video SAR Moving Target Detection Using Dual Faster R-CNN [J].

Wen, Liwu ;

Ding, Jinshan ;

Loffeld, Otmar .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :2984-2994

[28] EDN: Salient Object Detection via Extremely-Downsampled Network [J].

Wu, Yu-Huan ;

Liu, Yun ;

Zhang, Le ;

Cheng, Ming-Ming ;

Ren, Bo .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :3125-3136

[29] Detection and localization for lake floating objects based on CA-faster R-CNN [J].

Yi, Zeren ;

Yao, Dongyi ;

Li, Guojin ;

Ai, Jiaoyan ;

Xie, Wei .

MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) :17263-17281

[30] Research on highway vehicle detection based on faster R-CNN and domain adaptation [J].

Yin, Guanxiang ;

Yu, Meng ;

Wang, Meng ;

Hu, Yong ;

Zhang, Yuejin .

APPLIED INTELLIGENCE, 2022, 52 (04) :3483-3498

← 1 2 3 4 5 →