Multi-scale object detection by top-down and bottom-up feature pyramid network

被引:0
|
作者
ZHAO Baojun [1 ,2 ]
ZHAO Boya [1 ,2 ]
TANG Linbo [1 ,2 ]
WANG Wenzheng [1 ,2 ]
WU Chen [1 ,2 ]
机构
[1] School of Information and Electronics, Beijing Institute of Technology
[2] Beijing Key Laboratory of Embedded Real-time Information Processing Technology,Beijing Institute of Technology
基金
中国国家自然科学基金;
关键词
convolutional neural network(CNN); feature pyramid network(FPN); object detection; deconvolution;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network(TDBU-FPN),which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement suboriginal information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes(PASCAL VOC)dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [31] ZigZagNet: Fusing Top-Down and Bottom-Up Context for Object Segmentation
    Lin, Di
    Shen, Dingguo
    Shen, Siting
    Ji, Yuanfeng
    Lischinski, Dani
    Cohen-Or, Daniel
    Huang, Hui
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7482 - 7491
  • [32] A Bottom-Up and Top-Down Integration Framework for Online Object Tracking
    Li, Meihui
    Peng, Lingbing
    Wu, Tianfu
    Peng, Zhenming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 105 - 119
  • [33] Unsupervised Sounding Object Localization with Bottom-Up and Top-Down Attention
    Shi, Jiayin
    Ma, Chao
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2161 - 2170
  • [34] Multi-scale redistribution feature pyramid for object detection
    Qian, Huifang
    Guo, Jiahao
    Zhou, Xuan
    AI COMMUNICATIONS, 2022, 35 (01) : 15 - 30
  • [35] Multi-scale Pyramid Feature Maps for Object Detection
    Hao Huijun
    Ye Ronghua
    Chen Zhongyu
    Zheng Zhonglong
    2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, : 237 - 240
  • [36] TBINet: fabric defect detection based on a top-down and bottom-up inference network
    Liu, Zhoufeng
    Huang, Ning
    Guo, Zijing
    Gao, Chengli
    Li, Chunlei
    JOURNAL OF THE TEXTILE INSTITUTE, 2024, 115 (07) : 1106 - 1117
  • [37] Top-Down versus Bottom-Up Macroeconomics
    De Grauwe, Paul
    CESIFO ECONOMIC STUDIES, 2010, 56 (04) : 465 - 497
  • [38] Medezeggenschap: top-down en bottom-up
    Astrid van de Weijenberg
    Management Kinderopvang, 2016, 22 (1): : 24 - 26
  • [39] Mass spectrometry: Bottom-up or top-down?
    Chait, Brian T.
    SCIENCE, 2006, 314 (5796) : 65 - 66
  • [40] Lastprognose - Top-down oder bottom-up?
    Stephanblome, Th.
    Buhner, V.
    VDI Berichte, 2002, (1688): : 91 - 100