Multi-scale object detection by top-down and bottom-up feature pyramid network

被引:0
|
作者
ZHAO Baojun [1 ,2 ]
ZHAO Boya [1 ,2 ]
TANG Linbo [1 ,2 ]
WANG Wenzheng [1 ,2 ]
WU Chen [1 ,2 ]
机构
[1] School of Information and Electronics, Beijing Institute of Technology
[2] Beijing Key Laboratory of Embedded Real-time Information Processing Technology,Beijing Institute of Technology
基金
中国国家自然科学基金;
关键词
convolutional neural network(CNN); feature pyramid network(FPN); object detection; deconvolution;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network(TDBU-FPN),which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement suboriginal information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes(PASCAL VOC)dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] IMPLEMENTING DIVERSITY: TOP-DOWN AND BOTTOM-UP
    Lievens, S.
    ICERI2016: 9TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION, 2016, : 4650 - 4650
  • [42] Top-down bottom-up graphene synthesis
    Zhang, Zishuai
    Fraser, Alison
    Ye, Siyu
    Merle, Geraldine
    Barralet, Jake
    NANO FUTURES, 2019, 3 (04)
  • [43] Bottom-up and top-down attention are independent
    Pinto, Yair
    van der Leij, Andries R.
    Sligte, Ilja G.
    Lamme, Victor A. F.
    Scholte, H. Steven
    JOURNAL OF VISION, 2013, 13 (03): : 16
  • [44] Visual search: Bottom-up or top-down?
    Patel, GA
    Sathian, K
    FRONTIERS IN BIOSCIENCE, 2000, 5 : D169 - D193
  • [45] A VISUAL ATTENTION MODEL COMBINING TOP-DOWN AND BOTTOM-UP MECHANISMS FOR SALIENT OBJECT DETECTION
    Fang, Yuming
    Lin, Weisi
    Lau, Chiew Tong
    Lee, Bu-Sung
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1293 - 1296
  • [46] Top-down and bottom-up quality management
    Hurst, Keith
    INTERNATIONAL JOURNAL OF HEALTH CARE QUALITY ASSURANCE, 2010, 23 (07) : 16 - 17
  • [47] Top-down and bottom-up research in biodemography
    Kaplan, Hillard
    Gurven, Michael
    DEMOGRAPHIC RESEARCH, 2008, 19 : 1587 - 1602
  • [48] Bottom-Up and Top-Down Graph Pooling
    Yang, Jia-Qi
    Zhan, De-Chuan
    Li, Xin-Chun
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 568 - 579
  • [49] When bottom-up meets top-down
    Dittrich, W
    TRENDS IN COGNITIVE SCIENCES, 2001, 5 (04) : 137 - 137
  • [50] A top-down look at bottom-up electronics
    Lundstrom, M
    2003 SYMPOSIUM ON VLSI CIRCUITS, DIGEST OF TECHNICAL PAPERS, 2003, : 5 - 8