Feature Learning Improved by Location Guidance and Supervision for Object Detection

被引:1
作者
Li, Bingying [1 ,2 ]
Xiong, Jiale [1 ,3 ]
Fu, Xiang [1 ,3 ]
Zeng, Jiexian [1 ,2 ,3 ]
Leng, Lu [1 ,3 ,4 ]
机构
[1] Nanchang Hangkong Univ, Key Lab Jiangxi Prov Image Proc & Pattern Recogni, Nanchang 330063, Jiangxi, Peoples R China
[2] Nanchang Hangkong Univ, Sci & Technol Coll, Nanchang, Jiangxi, Peoples R China
[3] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Jiangxi, Peoples R China
[4] Yonsei Univ, Sch Elect & Elect Engn, Coll Engn, Seoul 120749, South Korea
基金
中国国家自然科学基金;
关键词
Feature extraction; Detectors; Object detection; Convolution; Semantics; Data mining; Head; feature alignment; multiple detection; consistency supervision;
D O I
10.1109/ACCESS.2021.3110888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the single-stage detectors have been developed rapidly; however, compared with the multi-stage detectors, their detection precision is still relatively low. Single-stage detectors and multi-stage detectors are analyzes and compared in detail in this paper, which reveals that single-stage detectors suffer from some problems, including feature loss and inaccurate feature extraction. Therefore, this paper proposes a novel detection model, dubbed Optimized Network (OptNet), to alleviate these deficiencies. OptNet consists of three modules: pyramid of attention features, feature alignment and consistency supervision (CS). The pyramid of attention features, based on feature pyramid networks (FPNs), introduces a novel branch named attention FPN (AtFPN), which aggregates the multi-layer features of the backbone network and optimizes the object features by using lightweight attention modules. AtFPN alleviates the loss of the feature pyramid information and the blocking of feature transmission between adjacent layers. Meanwhile, it provides global information for the model. The feature alignment module aligns the anchor box to the feature by using the object location information to guide the network to extract precise object features. Finally, CS accelerates network optimization and reduces semantic differences between the features on different layers. In the detection stage, OptNet optimizes the prediction of the model with the first detection result to improve the accuracy. Experiments on the MS COCO 2017 dataset demonstrate that OptNet yields significant improvement in the detection precision.
引用
收藏
页码:133335 / 133345
页数:11
相关论文
共 50 条
[41]   Improved domain adaptive object detector via adversarial feature learning [J].
Marnissi, Mohamed Amine ;
Fradi, Hajer ;
Sahbani, Anis ;
Ben Amara, Najoua Essoukri .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 230
[42]   Mutual-Assistance Learning for Object Detection [J].
Xie, Xingxing ;
Lang, Chunbo ;
Miao, Shicheng ;
Cheng, Gong ;
Li, Ke ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) :15171-15184
[43]   Cross-Domain Adaptive Object Detection Based on Refined Knowledge Transfer and Mined Guidance in Autonomous Vehicles [J].
Wang, Ke ;
Pu, Liang ;
Dong, Wenjie .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01) :1899-1908
[44]   Generalized Feature Learning for Detection of Novel Objects [J].
Liu, Jierui ;
Liu, Xilong ;
Cao, Zhiqiang ;
Yu, Junzhi ;
Tan, Min .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (01) :388-395
[45]   Effective Rotate: Learning Rotation-Robust Prototype for Aerial Object Detection [J].
Wang, Chaowei ;
Guo, Guangqian ;
Liu, Chang ;
Shao, Dian ;
Gao, Shan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-14
[46]   Multiscale Semantic Guidance Network for Object Detection in VHR Remote Sensing Images [J].
Zhu, Shengyu ;
Zhang, Junping ;
Liang, Xuejian ;
Guo, Qingle .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[47]   UW-DETR: Feature Fusion Enhanced RT-DETR for Improving Underwater Object Detection [J].
Li, Xingkun ;
Wang, Yugang ;
Zhao, Yuhao ;
Chen, Guodong .
IEEE ACCESS, 2024, 12 :191967-191979
[48]   Object Detection for Aerial Images With Feature Enhancement and Soft Label Assignment [J].
Yu, Ying ;
Yang, Xi ;
Li, Jie ;
Gao, Xinbo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[49]   Self-Supervised Feature Augmentation for Large Image Object Detection [J].
Pan, Xingjia ;
Tang, Fan ;
Dong, Weiming ;
Gu, Yang ;
Song, Zhichao ;
Meng, Yiping ;
Xu, Pengfei ;
Deussen, Oliver ;
Xu, Changsheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :6745-6758
[50]   Multilevel Spatial-Temporal Feature Aggregation for Video Object Detection [J].
Xu, Chao ;
Zhang, Jiangning ;
Wang, Mengmeng ;
Tian, Guanzhong ;
Liu, Yong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) :7809-7820