Feature Learning Improved by Location Guidance and Supervision for Object Detection

被引:1
作者
Li, Bingying [1 ,2 ]
Xiong, Jiale [1 ,3 ]
Fu, Xiang [1 ,3 ]
Zeng, Jiexian [1 ,2 ,3 ]
Leng, Lu [1 ,3 ,4 ]
机构
[1] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recogni, Nanchang 330063, Jiangxi, Peoples R China
[2] Nanchang Hangkong Univ, Sci & Technol Coll, Nanchang, Jiangxi, Peoples R China
[3] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Jiangxi, Peoples R China
[4] Yonsei Univ, Sch Elect & Elect Engn, Coll Engn, Seoul 120749, South Korea
基金
中国国家自然科学基金;
关键词
Feature extraction; Detectors; Object detection; Convolution; Semantics; Data mining; Head; feature alignment; multiple detection; consistency supervision;
D O I
10.1109/ACCESS.2021.3110888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the single-stage detectors have been developed rapidly; however, compared with the multi-stage detectors, their detection precision is still relatively low. Single-stage detectors and multi-stage detectors are analyzes and compared in detail in this paper, which reveals that single-stage detectors suffer from some problems, including feature loss and inaccurate feature extraction. Therefore, this paper proposes a novel detection model, dubbed Optimized Network (OptNet), to alleviate these deficiencies. OptNet consists of three modules: pyramid of attention features, feature alignment and consistency supervision (CS). The pyramid of attention features, based on feature pyramid networks (FPNs), introduces a novel branch named attention FPN (AtFPN), which aggregates the multi-layer features of the backbone network and optimizes the object features by using lightweight attention modules. AtFPN alleviates the loss of the feature pyramid information and the blocking of feature transmission between adjacent layers. Meanwhile, it provides global information for the model. The feature alignment module aligns the anchor box to the feature by using the object location information to guide the network to extract precise object features. Finally, CS accelerates network optimization and reduces semantic differences between the features on different layers. In the detection stage, OptNet optimizes the prediction of the model with the first detection result to improve the accuracy. Experiments on the MS COCO 2017 dataset demonstrate that OptNet yields significant improvement in the detection precision.
引用
收藏
页码:133335 / 133345
页数:11
相关论文
共 50 条
[31]   Feature-Aligned Single-Stage Rotation Object Detection With Continuous Boundary [J].
Yuan, Yuan ;
Li, Zhiguo ;
Ma, Dandan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[32]   Multi-Scale Information Interaction and Feature Pyramid Network for Salient Object Detection [J].
Fang, Jie ;
Zhang, Zhicheng ;
Zhang, Shasha .
IEEE ACCESS, 2025, 13 :106435-106442
[33]   Efficient Feature Focus Enhanced Network for Small and Dense Object Detection in SAR Images [J].
Li, Cong ;
Xi, Lihu ;
Hei, Yongqiang ;
Li, Wentao ;
Xiao, Zhu .
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 :1306-1310
[34]   AEFFNet: Attention Enhanced Feature Fusion Network for Small Object Detection in UAV Imagery [J].
Nian, Zhaoyu ;
Yang, Wenzhu ;
Chen, Hao .
IEEE ACCESS, 2025, 13 :26494-26505
[35]   FRLI-Net: Feature Reconstruction and Learning Interaction Network for Tiny Object Detection in Remote Sensing Images [J].
Chen, Penglei ;
Wang, Jiangtao ;
Zhang, Zhiwei ;
He, Cheng .
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 :2159-2163
[36]   Boost UAV-Based Object Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning [J].
Liu, Fan ;
Yao, Liang ;
Zhang, Chuanyi ;
Wu, Ting ;
Zhang, Xinlei ;
Jiang, Xiruo ;
Zhou, Jun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[37]   SSRDet: Small Object Detection Based on Feature Pyramid Network [J].
Zhang, Lijuan ;
Wang, Minhui ;
Jiang, Yutong ;
Li, Dongming ;
Zhou, Yue .
IEEE ACCESS, 2023, 11 :96743-96752
[38]   Improving Single Shot Object Detection With Feature Scale Unmixing [J].
Li, Yazhao ;
Pang, Yanwei ;
Cao, Jiale ;
Shen, Jianbing ;
Shao, Ling .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :2708-2721
[39]   Reinforced Neighbour Feature Fusion Object Detection with Deep Learning [J].
Wang, Ningwei ;
Li, Yaze ;
Liu, Hongzhe .
SYMMETRY-BASEL, 2021, 13 (09)
[40]   Enhanced Spatial Feature Learning for Weakly Supervised Object Detection [J].
Wu, Zhihao ;
Wen, Jie ;
Xu, Yong ;
Yang, Jian ;
Li, Xuelong ;
Zhang, David .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) :961-972