ALFPN: Adaptive Learning Feature Pyramid Network for Small Object Detection

被引:4
作者
Chen, Haolin [1 ,2 ]
Wang, Qi [1 ,2 ]
Ruan, Weijian [3 ]
Zhu, Jingxiang [2 ]
Lei, Liang [2 ]
Wu, Xue [1 ]
Hao, Gefei [1 ]
机构
[1] Guizhou Univ, Coll Comp Sci & Technol, State Key Lab Publ Big Data, Guangzhou 550025, Guangdong, Peoples R China
[2] Guangdong Univ Technol, Sch Phys & Optoelect Engn, Guangzhou 510006, Guangdong, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection;
D O I
10.1155/2023/6266209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection has become a crucial technology in intelligent vision systems, enabling automatic detection of target objects. While most detectors perform well on open datasets, they often struggle with small-scale objects. This is due to the traditional top-down feature fusion methods that weaken the semantic and location information of small objects, leading to poor classification performance. To address this issue, we propose a novel feature pyramid network, the adaptive learnable feature pyramid network (ALFPN). Our approach features an adaptive feature inspection that incorporates learnable fusion coefficients in the fusion of different levels of feature layers, aiding the network in learning features with less noise. In addition, we construct a context-aligned supervisor that adjusts the feature maps fused at different levels to avoid scaling-related offset effects. Our experiments demonstrate that our method achieves state-of-the-art results and is highly robust for the small object detection on the TT-100K, PASCAL VOC, and COCO datasets. These findings indicate that a model's ability to extract discriminant features is positively correlated with its performance in detecting small objects.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Improving cross-dimensional weighting pooling with multi-scale feature fusion for image retrieval
    Wang, Qi
    Lai, Jinxiang
    Yang, Zhenguo
    Xu, Kai
    Kan, Peipei
    Liu, Wenyin
    Lei, Liang
    [J]. NEUROCOMPUTING, 2019, 363 : 17 - 26
  • [42] AIoU: Adaptive bounding box regression for accurate oriented object detection
    Wen, Nu
    Guo, Renzhong
    Ma, Ding
    Ye, Xiang
    He, Biao
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (01) : 748 - 769
  • [43] Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation
    Wu, Jialian
    Song, Liangchen
    Wang, Tiancai
    Zhang, Qian
    Yuan, Junsong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1570 - 1578
  • [44] Objectness Consistent Representation for Weakly Supervised Object Detection
    Yang, Ke
    Zhang, Peng
    Qiao, Peng
    Wang, Zhiyuan
    Li, Dongsheng
    Dou, Yong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1688 - 1696
  • [45] Dual-branch mutual assistance network for salient object detection
    Yao, Zhaojian
    Wang, Luping
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (01) : 972 - 990
  • [46] Zhang M, 2020, MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, P4107, DOI [10.1145/33941713413969, 10.1145/3394171.3413969]
  • [47] GraphFPN: Graph Feature Pyramid Network for Object Detection
    Zhao, Gangming
    Ge, Weifeng
    Yu, Yizhou
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2743 - 2752
  • [48] EMO-MVS: Error-Aware Multi-Scale Iterative Variable Optimizer for Efficient Multi-View Stereo
    Zhou, Huizhou
    Zhao, Haoliang
    Wang, Qi
    Lei, Liang
    Hao, Gefei
    Xu, Yusheng
    Ye, Zhen
    [J]. REMOTE SENSING, 2022, 14 (23)
  • [49] Traffic-Sign Detection and Classification in the Wild
    Zhu, Zhe
    Liang, Dun
    Zhang, Songhai
    Huang, Xiaolei
    Li, Baoli
    Hu, Shimin
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2110 - 2118
  • [50] RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection
    Zong, Zhuofan
    Cao, Qianggang
    Leng, Biao
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5637 - 5645